
3 0 APR im 



JVlPO 

aft. 



PCX 



UNITED STATES DEPARTMENT OF COMMERCE 
United States Patent and Trademark Office 

April 26, 2004 

TfflS IS TO CERTIFY THAT ANNEXED HERETO IS A TRUE COPY FROM 
THE RECORDS OF THE UNITED STATES PATENT AND T RADEMARK 
OFFICE OF THOSE PAPERS OF THE BELOW IDENTIFIED PATENT 
APPLICATION THAT MET THE REQUIREMENTS TO BE GRANTED A 
FOJD^iGDATE. 

APPLICATION NUMBER: 60/445,539 
FILING DATE: February 06, 2003 

RELATED PCT APPLICATION NUMBER: PCrAJS04/03545 





By Authority of the 
OMMISSIONER OF PATENTS AND TRADEMARKS 



M. SIAS 
Certifymg Officer 



PRIORITY 
DOCUMENT 

SUBMITTED OR TRANSMITTED IN 
COMPLIANCE WITH RULE 17.1(a) OR (b) 



CD 

f;; 



PROVISIONAL APPUCA TION CO VER SHEET 




CU 

o • 

!S » This is a request for filing a PROVISIONAL APPLICATION under 37 CFR 1 , 53(b)(2). 





Docket Number 



14.03P 



Type a plus sign ( + ) 
tnslde this box - 



LAST NAME 



RAohnen 
Sterling 
Doong 
Kollf 



INVENTOR(s) APPLICANT(S) 



FIRST NAME 



Debra 



Ron 

Venkata 



MIDDLE 
INITIAL 



D 
L 

S.K. 



RESIDENCE (CITY. AND EITHER STATE 
OR FOREIGN COUNTRY) 



Athens. GA 
Athens. GA 
Athens, OA 
Ouluth, OA 



TITLE OF THE INVENTION (280 characters max) 



DISCOVERY OF A GENE ENCODING AN.EXPRESSED ALPHA-1.4-GALACTURONbSYLTRANSFERASE IN 
ARABiDOPSiS THALIANA AND OF A FAMILY OF PROPOSED GALACTURONOSYLTRANSFERASES. 
AND THEIR GENES. IN ARABiQOPSiS THAUANA AND PLANTS 



CORRESPONDENCE ADDRESS 



iiilM 

23713 

PATENT TRADEMARK OFHCE 



GREENLEE. WINNER AND SULLIVAN. P.O. 
5370 Manhattan Circle. Suite 201 
Boulder/CO 80303 USA 
Phone: (303) 49d>8080; Fax (303) 499*8089 
.Email: winner@Greenwin.com 



ENCLOSED APPUCATION PARTS {check all that apply! 



^ Speciftcation Parts 18i II _60 Number of Pages 
. Drawing(s) Number of Sheets X 



X Applicants Claim Small Entity Status. See 37 CFR 1 .27 



Other (specify) Appendices I & II 41 pages 



METHOD OF PAYMENT Yc^gcA ona} 



A ehedc or money order is enclosed to cover the Provldonal filing feefl. 
•The Commissioner Is hereby outhorized to charge fnbig fee and credit Deposit Account Number: 07-1869 
The Qommissioner Is hereby authorized to charge any defldeney or credit any overpayment to Oepesit Account 07-1989 



PROVISIONAL FILING FEE AMOUNT $80:00 



The invention was made by an agency of the United States Goverrtment or under a conjtract with an agency of the United States 
Government. 

I . No jt_ Yes, the name of the U.S. Government agency and the' Gpvemment contract number are: 

USDA 98-35304-6772 and USDA 2001-03351 

Respectfully submitted, 



SIGNATURE Heeja Yoo-Warren 



REGISTRATION NO. 45,495 



•DATE: 



February .6, 2003 




PROVISIONAL PATENT APPLICATION 



Debra Mohnen 
Jason Divight Sterling . 
Ron Lou Doong 
Venkata Siva Knmar Kolli 



DISCOVERY OF A GENE ENCODING AN EXPRESSED ALPHA- 
1,4-GALACTURONOSYLTRANSFERASE IN ARABIDOPSIS 
TilALMA^A AND OF A FAMILY OF PROPOSED 
GALACTURONOSYLTRANSFERASES, AND THEIR GENES, IN 
ARABWOPSISTHALIAJ^A AND ¥1. ANTS 



CERTIFICATE OF MAILING 

I Iiereby certify diat this correspondence 
is bdug deposited with the United States 
Postal Service as "Bxpress Mail" in an 
envelope addressedto die 
Conunissioner for Patents. 



Date • ff 



loo3 6. 



Date • ^ .B. Kroge 

Express MaU Receipt EL 946 759 260 US 



Prepared by: 

GREENLEE, WINNER AND SULLIVAN, F.C. 

5370 Manhattan Circle 
Suite 201 
Boulder, Colorado 80303 
(303) 499-8080 
FAX: (303) 499-8089 



Attorney Dodcet No. 14-03P 



li • ■» 



14-03P 

\ PARTI 

. Role of Galacturonosyltransf erase i(GAlATl) in Pectin Synthesis 

Summary 

Pectin is a complex! family of cell wall polysaccharides that have multiple roles in plant growth, 
development and disease resistance. Pectins are also gelling and stabilizing agents in the food 
industiy, and rieutraceuticals and potential pharmaceuticals with beneficial effects on human 
health. ' We recently identified the- .first gene (GALATl) for a pectin biosynthetic 
glycosyltransferase: Arabidopsis thaliam a-i,4-galacturonosyltransferase I (GALATl); The 
specific aims of this proposal are to: (1) Characterize heterologously expressed GALATl for 
kinetic constants, substrate and acceptor specificity, pjH optima, thermal stability, 
quaternary structure, and post-translatipn glycosylation. These characteristics will aid in 
detennining the role of GALATl in honiogalacturonan synthesis, possible role in 
rhanrnogalacturonan I and/or rhainnogalacmronan n synthesis, and provide enzyme kinetic and 
specificity data for use in deciphering enzyme and regulatory mechanisms. (2) Produce a series 
of mutated GALATl proteins by heterologous expression of site-directed mutated GALATl 
genes and characterize the heterologously expressed mutated proteins. Characterization of 
the kinetic constants of the mutated enzymes will facilitate the identification of amino acids 
important for enzyme activity and substrate/acceptor binding. (3) Generate and use anti- 
GALATl antibodies for iraanunocytochemistry analysis of the sut>cellular location of 
GALATl. These experiments will establish in which region of the Golgi GALATl is located 
and allow the development, of models for the compartmentalization of pectin synthesis within the 
endomembrane system. (4) Use RNAi to generate GALATl knockouts and characterize the 
phenotype of the mutated plants. The long-term goal is to understand the role of GALATl in 
pectin synthesis and use this knowledge to produce plants with improved agronomical value. 



SPiBOOgiC AIMS 

The goals of this proposal are to biochemically characterize GALATl: the first pectin 
biosynthetic glycosyltransferase, and first galacturonosyltransferase from any ^ecies, to be at 
identified at the gene level. Our preliminaiy data suggest that the protein encoded by GALATl 
(GALATl), is the same al .4-galacturonosyltransferase that has been shown by us^*^ and others 

to transfer galacturonic apid (GalA) fromUDP-GalA onto homogalactu'ronan (HGA) (Fig. 1) 
and/or commercially available pectin acceptors. Thus, we propose that GALATl encodes UDP^ • 
galacturonic acidrHGA al.4-gaaacturonosyltransferase (i.e. GALATl m Table I). However, 
whether GALATl adds GalA only onto HGA, or can also add GalA to the backbone of the other 
pectic polysaccharides rhamnogalacturonan I (RG-I) (Fig. 2) or rhamnogalacturonan n (RG-II) 
(Fig. 3) is not known. Answers to these questions arc important to identify* which of the 4-9 
different GalATs required for pectin synthesis (Table I) is encoded by GALATL Furthermore, 
since GALATl is the first galacturonosyltransfisrase gene to be identified from any species,* 
nothing is known about the regions of the protein involved in subistrate or acceptor binding, nor 
are the catalytic amino acids known. The above questions will be addressed by Specific Aims 1 



ahd2 w hich are to characterize the kinetics and substrate specificity of native GALATl 
(Aim 1), and modified GALATl generated by site-directedl mutagenesis (Aim 2) in order to 
define the regions of the enzyme important in catalysis and substrate binding and to 
detenmine the specificity of GALAT for HGA^ RG-I and RG-II synthesis* The enzymes will 
also be analyzed for pH optima and thenrial stability, to compare GALATl with the previously 
published Gal ATs (Table II), and possible quaternary structure and post-translation 
glycosylation. The results of the kinetic analyses will indicate whether the apparent biphasic 
kinetics obtained for GalAT in membrane bound and solubilized plant fractions (see below) are a 
characteristic of GALATl and, if so, whether the kinetics represent allosteric regulation of the 
enzyme. Specific Aim 3 i s to determine the subcellular location of GALATl via 
inununocytochemistry using anti-GALATl antibodies to be generated. We previously 
showed that GalAT activity is membrane bound and localized to the lumen of the Golgi, and 
thus, proposed that the HGA biosynthetic (MAlT is a Golgi resident enzyme with its catalytic site 
in the Golgi lumen. The predicted topology of GALATl encoded by GALATl is completely 
consistent with these results since it is predicted to be a Type II meml>rane protein?^ with a an N- 
terminal cytosol facing region, a single transmeifnbrane-spanning region and a larger C-tenninal 
region. Location in the Golgi is also consistent with prior autoradiographic^^'^,, 
immunocytochemical^**^ and subcellular fractionation''-^'^^^ results showing diat pectin synthesis 
occurs in the Golgi. However, the distribution of the different pectiii biosynthetic enzymes 
within the Golgi complex is not known. Sudi information is critical to detemune which of the 
expected 72 predicted pectin biosynthetic enzymes' (see also Appendix 1) interact together to 
synthesize pecdn. The proposed inununocytochemistry study would be the first information on 
the subcellular location of any pectin biosyndietic glycosyltransferase and would allow 
formulation of models for how pectin is synthesized within the Golgi. The antibodies to be 
generated in Specific Aim 3 would also be used to determine if GalAT is a multiineric protein, 
and if so, to establish its ^toichiometry. SpedilcAim4 is to generate GALATl knockouts 
throi^ RNAi to gain information about the rbfe of GALATl in vivo. The results of these 
experiments will be most meaningful if viable plants that express no GALATl are obtained, and 
if such plants show changes in pectin stracturc. The resulting changes should represent the 
accumulation of GALATl precursors and an absence of the pectin normally generated by 
GALATl. If time allows, additibiial experiments will also be initiated to use the anti-GALATl 
antibodies generated in Specific Aim 3 as -'binding partners" to identify Arabidopsis proteins that 
bind to GALATl using iimnunoabsorption, followed by mass spectrometry sequencing of the 
iproteinsl This ^proach would allow us to begin to identify members of a predicted GALATl 
enzyme complex. A second line of research that would be initiated if time and resources allow is 
"to attempt toobtain 3D structural information on GALATl via X-ray crystallography and 
possibly NRM spectroscopy- Oux long term goal is to identify the lanzymes involved in pectin 
-synthesis, concentrating initially on HGA synthesis, and to decipher the process by which these 
enzymes work together to synthesize the complicated family of cell wall polysaccharides kno^yn 
as pectin. ...... - 
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HGA is synthesized first, followed by RG-I and RG-EL The numbers were given' to facflitate a comparison of the 
enzymes, but final numbering will likely correspond to the order in which the genes are identified. 
^HGA: homogalactiironan; RG-I: Rhan:inogalactiironan I; RG-II: Rhanuiogalacturonan 11; XGA: XylogalacturOnan; 
AGA; Apiogalacturonan 

^All sugars are D sugars, and have . pyranose . rings unless otherwise indicated. Glycosyitranferases 
add Co the glycqsyl residue on the left* of the indicated acceptor. 

''The ? means the designated Gal AT may be required if a different Gal AT in the list does not perform the 

designated function ' ^ 

^Glycosybesidue in the parenthesis is branched off the first GalA 
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figure 1- THmmnglon ofhomogatactaromat (HGA). 
HGA is alinearbomopolyin6rofa-1.4-lin]cedgalacturonic 
add that may be inethytesterfied at <Z6 and acetylated at 02 or . 
03. Subsdnited^^tacturoiians* such as KOAU apiogaUisiuniiiaB 
and xylog^acaiienan have an HGA backbone. - 
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Ffgnre 3. RepresenatiYe structure for rhamnotalotturonan // (RG-II). RG-n has a 
backbone of 1,4-linked a'D-CalfpA fcsidues. GdA residues are also present io RG-II - 
^dschauiB. 



Figure 2. SMprtttnalhm structure of rtiamnogqSaeturonan i (RG'i). 
RO-I has on dteniados |-r^)-a4>-Oa^A-<1-:»2>- a-t^Rhap-(l-*] 
bocKbone in wUcb roaciAy 20-80% of the rhamnoses aiB-snbsiitniEd by. 



V 
1 

BACKGROUND 

Pectin Structure & Function 

Pectin is the most complex polysaccharide in the plant cell walL It is comprises 30-40% 
of the primary wall of dicots and non-graminacepius raonocots, and - 10% of the primary wall in 
the grass family. Pectins are a family of polysaccharides'**'^^ that include homogalacturonan 
(HQA) (Fig. 1), rhamnogalacturonan-I (RG-I) (Fig. 2) and rhamnogalactiironan H (RG-U) (Fig. 
3) as well xylogalacturonaris QLGK?^^^\xsA apiogalacturonansS^-^ (see Appendix I). While the 
specific stmcture of each of these polysaccharides differs (Figs. 1-3), they are grouped into one 
family since tliey appear to be linked to each other in the wall and they each contain a-D- 
galacturonic add connected in a 1,4-linkage. * • 

HGA is the m6st abundant pectic polysaccharide, accounting for -55%-70% of pectiii^^. 
HGA is a linear .honiopolymer of al,4-linked Drgalactosyluronic acid that is partially . 
methyjesterified at the C6 carboxyl group and may be partially acetylated at. 0-2 and/or 0-3^ 
(Fig. 1). Some plants also contain H(5A that is substituted at the 2 or 3 position by D- 
apiofuranose^ the so-called ^iogalacturonans (AGA)^'*^^ and/or HGA that is substituted at tfie 3 
position with D-xylose^^"^ , so-called xylogalacturonan (XGA). RG-II is a complex 
jpolysaccharide that accounts for approximately. l6rll% of pectin^'^'. RG-II has an HGA 
backbone with four structurally complex iside chains attached to C-2 and/or C-3 of the GalA^*^^ 
(Fig, 3)..Rhamnogalactaronan I (RG-D account^ for 20-35% of pectin^^ (Fig. 3). RG-I is a family 
of polysaccharides with an alternating [->4)-a-D-GalA-(l— >2)- a-L-Rha-(I->] backbone in " 
which roughly 20-80% Of the riiamnoses are substituted by arabinan, galactan, or aiabinogalactan 
side.branches^*'^'\ 

• Pectins are believed to have multiple roles during plant growth, development, and in plant 
defense responses. For example, peCtlc polysaccharides play essential roles in cell wall 
strucfure"*^, cell adhesion"*^ and cell signaling'*^*'*^ Pectins also appear to mediate pollen tube 
growth"^^ and to have roles during seed^hydration''^'*^, leaf abscission^^ water movement^^ and 
fruit development"^*^. Oligosaccharides cleaved from pectin also serve as signals to induce plant 
defense responses^^"^^. Studies of mutant plants with altered wall pectin reveal that modifications 
* of pectin .stmcture leads to dwarfed plants"^^, brittle leaves'^, reduced numbers of side shoots and 
flowers^*, malformed stomata"^ and reduced cell adhesion^^. 

Although pectins appear to have multiple roles in plants, in no case has their specific 
mechanism of action been determined. One way to directly test the biological roles of pectins, 
and to study their mechanisms of action, is to produce plants with specific alterations in pectin 
structure. .Thisxan be done by knocking out genes that encode the pectin biosynthetic enzymes. 
Such enzymes include the nucleotide-sugar biosynthetic enzymes and the glycosyltransferases 
that synthesis thjs pectic polysaccharides. Each glycosyltransferase is expected to transfer a 
' unique glycosyl residue in a specific linkage onto a specific polymeric/oligomeric acceptor. To 
date, only five^^^^ of the more than 200 predicted wall biosynthetic glycosyltransferases have 
been identified at the gene level, and none of these have been shown to encode pectin 
biosynthetic enzymes. The progress report below describes our identification of the first gene 
(GALATI) encoding a pectin biosynthetic enzymje, GALATl. The identification of this gene 
now allows the characterization of this biosynthetic enzyme, the use of this gene to produce 
mutated enzyme and plants, and studies of the irole of this specific GalAT in homogalacturonan 



and pectin synthesis. For clarity, please note that in this proposal we refer to the game as 
. GALATI, to the protein produced by the gene as iSALATl , and to an al ,4- 
galacturonosyltnmsferase enzyme activity in plant extracts as GalAT. 

Pectin Synthesis 

Based on the known structure of pectin, at least S8 distinct glycosyl-, meth]|9- and acetyl- 
transferases are required to synthesize the family of polymers known as pectin. As dtown in the 
. review^ in Appendix J and in Table I, between 4-9 galacturonbsyltranferases are prenEcted to be . 
required for the synthesis of HGA, RG-I, RG-II and possibly for the synthesis of the modified 
forms of HGA Known as XGA and AG A. Since, as described in the progress report below, the 
GALTAl gene was identified based on the presence of the GALAT 1 protein in Arabidopsis 
partially purified protein fractions that contained UDP-GalA:HGA al ,4GralAT enzyme activity, 
we propose that GALATl is UiDP-GalA:HGA al,4GalAT. Thus, the following review of pectin 
sjoithesis will concentrate on prior research on GalAT.' Additional information on other pectin' 
biosynthetic enzyme activities can be found in the review in Appendix 1^. 

. UDP'GalA:Hornogalacturonan oc- 1,4'Galacturonosyltransf erase (pALATl) 
Membrane-bound al-4galacturon6syltransferase (GalAT) activiiy has been identified and 
partially characterized in mung bean*^"* tomato*^ tumip*\ sycamore*^, tobacco suspensions 
radish roots^j enriched Golgi froiri pea^, -Azuki bean\*. Petunia}^ ^ and Arabidopsis {Sierling and 
Mohnen, unpublished results) (see Table 11). The pea GalAT was localized to the Golgi^ with its 
catalytic site facing the lumenal side of the Golgi, . These results provide the first direct 
enzymatic evidence that the synthesis of HGA occurs in the Golgi. . In in vitro leactionis, GalAT . 
adds [^"^CIGalA from UljP-[*'*C]iGalA^'^ onto endogenous acceptors in microsomal membrane 
preparations to produce radiolabeled products of large molecular mass (i.e. --10S kd ip tobacco ; 
microisomal membranes^ and > 500 kd in pea Golgi^). The cleavage of up to 89% of the 
radiolabeled product into GalA, digalacturonic acid (diGalA) and trigalacturonic acid (triGalA) 
following exhaustive hydrolysis with a purified endppolygalacturonase confimied thai the 
product: ^ *. 



Table n. Compqnsonof apparent catalytic constants and pH optinmni of HGA-al,^ 
galacturonosyltransferases^*^ ■ ' • ' 



Enzyme' 


Rant Source 


Apparent K„, for 
UDP-GalA . 
(UM) 


pH 
optimum 


Vmax 
(pmol mg*'min*^ 


Ref 


GalAT' 


inung bean 


1.7 


6.0 


. . -4700 


id 


GalAT ■ 


mnng bean 


h.d. 


n.d. 


n.d. 


i\ 


GalAT 


pea 


ii.d.* 


6.0 


n.d. 




GalAT 


pea 


■ n.d. 


. n.d. 


n.d. 


"7 


GalAT . 


sycamore 


' 770 


n.d. 


? 




GalAT ■ 


tobacco 


8.9 


7.8 


150 


2 


GalAT (sol)' 


tobacco 


37 


. 6.3-7.8 


290 


3 


GalAT (sol)' 


Petunia 


170 


7.0 


480 


B 


GalAT (per)* 


Azoki be^tn 


140 


6.8-7.8 


2700 


14 



Adapted from 

' Unless, indicated^ all enzymes are measured in particulate preparations 
^ (sol): detergent-solubilized enzyme 



' 5 



* (per): detergent-peimeabilized enasyme 
^ n.d.; hot detennined 

synthesized by tobacco OalAT was largely HGA- Thus, in vitro the crude enzyme catalyzes the 
reaction: UDP-GalAT + HGA(n) HGA(n+l) + UDP. The product produced in vitro in 
tobacco microsomes Wa$ - 50% esterified^ while the product produce in pea Golgi did not appear 
to be heavily esterified'^. These results suggest that the degree of methyl esterification of newly 
synthesized HGA may be species specific and that methylesterification occurs after tBn^ synthesis 
' of at least a short shretch of HGA. GalAT in detergent-permeabillzed tAlcrosomes fiom azuki 
. bean seedlings added (^'^CJGalA from UpP-[^*C]GalA onto acid-soluble i)olysaIlacturoriate 
(PGA) exogenous acceptors^"^. Treatment of the radiolabeled product with a puriffied fungal 
endopolygalacturonase yielded GalA and dlGalA, confirming that the activity identnSed was a 
GalAT comparable to that studied in tobacco and pea. The azuki bean enzyme had a auprisingly 
high specific activity of 1*300-2000 pmol mg*^ riiin"^, especially considering the laige amount 
(3.1-4.1 nmol mg"^ min"^) of polygalacturonase activity that was also present in the mcrosomal 
preparations- As with the product made by tobacco, no evidence for the processive transfer of 
galactosyluronic acid residues onto the acceptor was obtained (see below). 

• GjJAT can be solubilized from membrianes with detergent^. Solubilized GalAT adds 
GalA onto the non-ieducing end'^. oi^ exogenous HGA (oligogalacturonide; OGA) acceptors of a 
degrees of polymerization S ten^. Tlie^bulk of the HGA elongated m vitro. hy solubiliz&d Gal AT 
from tobacco membranes^, or detergent-permeabilized Golgi from pea^, at roughly equimolar 
UDP-GalA:acceptor concentrations, is eloiigated by a single GalA residue. These resaltes suggest 
that solubilized GalAT in vitro acts nonprocessively. (i.e, distributively). The apparent lack of in 
vitro processivity of GalAT was recently confirmed by Akita et ,aL who, using pyrid^ilaminated 
oligogalacturonates as substrates and high concentrations of UDP-GalA, showed diai; although 
OGAs can be elongated in a '^successive" fashion with up to 10 GalA residues by solubilized 
enzyme frpm petunia poUen^^ the kinetics of this response suggest a distributive inode.of action.. 
Our working hypothesis is that for a Golgi-Iocalized enzyme that synthesizes a complex polymer 
in a confined internal cellular compartment with sufficiently high concentrations of substrate, it 
is would- not necessarily be advantageous for the enzyme to act processively. In* fact, the reaction 
velocity could be hindered under such conditions if the enzyme were proces^ive^^. 

The apparent kinetic constants and -pH optimum for. the characterized GalATs aie shown 
in Table II. We have performed additional non-published kinetic studies in tobacco andiadish 

-that suggest that solubilizBd and 
membrane bound GalAT may 
have unusual apparent bqphasic 
kinetics: We tested Vo for radish 
GalAT at 2 fiM to 80 mM UDP- 
GalA and obtained a biphasic 
curve (Fig. 4), suggestitigthat the 
. kinetics of GalAT, at le^et in the 
membrane and soluble ftactions, 
are complex. Comparable results 
were also obtained for the 
solubilized radish and tobacco 
enzyme. The initial Vo vs (UDP- 
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fUDP-QalA] (uM) 



Fig. 4 GalAT kinetics in radish microsomal membranes. Radish mlcfosomal 
membranes (60 - 80 fig protein) were incabated with 70 |ig OGA (DP 7 - 23) andilie 
indicated concentrations of UDP-GalA. Eacli reaction contained a sniali concentra- 
tion of UDP-[^^C]GalA (2 - 3.6 >iM) witli larger amounts of npnradipaclive UDP- 
GalA. The precipitated reaction products were measured by liquid scintillation 
counting. The data are the averages of duplicate samples from three separate 
ex|]rerime)its. The Y axis is specific activity (pmol min** mg*')* 



GalA] curve was hyperbolic and appeared to reach an initial maximum Vo of - SOOpimDl mg'^ 
min'* at -1 mM UDP-GalA, confirming previous results reported for tobacco^'^. Howeyer, at > 2 
mM UDP-GalA there was a second hyperbolic increase in GalAT activity that reached a 
maximum of -2-4 nmol min"' nag"* with -20 mM UDP-GalA. Jn crude enzyme pieparalfens it 
was not possible to determine the basis for the unusual kinetics. One possibility is thatttwo 
GalATs were present, cine with a low Km and one with a high Km. Another possibility is that 
UDP-GalA is both a substrate and an allosteric regulator of GalAT. Alternatively, amose 
"trivial" explanation is that at low substrate concentrations the kinetics of GalAT were efflfccted 
by a catabolic enzyme (e.g. a phosphodiesterase) in the enzyme prep. The identification of the 
GAIATI gene (see progress report) and the production of pure heterologously expressed 
GALATl now allow us to study this further. . . 

PROGRESS REPORT 

Identification of the GAIATI Gene (results from currentUSDA grant) 

The major goal of our current USD A grant, in which we are in the begiunii^ of the . 
second yeai*, was to identify fhe Arabidopsis gene for GalAT by heterologous functional 
expression. As outlined below, we have been successfiil in transiently expressing GALATl 
in niammaliah ceils and identified the first gene for a GalAT. We are currently 
characterizing the expressed GalAT <a third goal of the current funded grant> arid are jn flhe 
process of .making stable, matimialian cell transfpimantS;. Due to our success with the 
heterologous expression, and to the Jack of available T-DNA mutant lines in our identified 
GalAT gene, we have not proceeded with the goal of identifying GalAT mutant plants. We 
postporied the RNAi.studies due to the reduction in the lequesti^d funding of our last grantt (see 
rather Specific Aim 4 in this renewal proposal). . ' 

We used a partial purification-taiidem mass spectrometry approach to identify putative 
QalAT genes from Arabidopsis (see Fig. 5 for schematic of strategy). GalAT fxom ArabMopsis 
was partially purified finom.detergent-solubilized enzyme by sequential passage over two « more 
of the following resins: cation exchange resin SP-Sepharpse, reactive green 19 resin, leadfive 



Arabidopsis cell 
suspensions 



SDS^^AGB analysis 




Comparison of amino 

GalAT enzyme 'a^^-^ ► Trypslnize proteins in ^ Tandem MS of ^ add sequence a^unst 

most active fractions* generated peptides Arabldopos genome 

database 

Figure's* OuAine of strategy to identify the gene for GalAT, The sequenced Arabidopsis genome allowed the use of a 
functions-based partial puriflcation*mass spectrometry approach cp identify the putative carbohydrate biosynihctic 
genesi 1: homogenatei 2: total membranes, 3: solubiti^d proteins, 4: initial anion exchange purification step. S: 
additional chromatography (not shown). 



t 

blue 72 resin, reactive yellow 3 resin, and UDP-agarose. Proteins obtained from seled&oii 
fractions from these, columns were treated wit]h trypsin to generate peptides, and the mam acid 

. sequence of the peptides identified by liquid chromatography-tandem mass spectromeficy.The 
amino sequence was used to screen ih& Ara(>idopsis gCTe/proteih database. Thirty uni^iae proteins 
were solely identified in the GalAT-containing fractions (i.e. not present in fractions not 
containing Gal AT activity). Among the 30 unique proteins that co-purified with GalATactivity, 

• two .proteins (designated JS33 and JS36) were identified as Arabidopsis putative GalAT 
proteins/genes based on their having, at least one predicted transmembrane domain and mice they 
contained a predicted glycosyltransferase dpmain (see CAZy database; http://afmb.cni5r' 
mrs.fr/CAZY/index.html^ , / . 

' These two genes, along with another Ara^i^fopsir gene with high sequence simSasity to 
JS36 (designated JS36L for JS36-like) (see below) were either cloned by RT-PCR (JS36and 
JS36L) using mRNA from Arabidopsis flower and stesm tissue (gift of Maor Bar-Peled„ OCRC), 
or a cDNA.clone was obtained from the Arabidopsis Biological Resource Center (JS33>. The 
proteins encoded by these genes each have a predicted single transmembrane domain CTaUe HQ. 
The genes were truncated to remove their N-terminal region including all or most of the * 
predicted transmembrane domain (see Table in), and the truncated genes were inserted inito the 
mammalian expression vector pEAK (gift of Kelley M(Mreinen lab,- CC3RC). 



Table ni. Predicted characteristics ofJS36, JS33 and JS36L proteins. Prediction:; made usin^ 
information from the NCBI database and the SOSUI (Classic & Membrane Prediction prpgram>2([BCM 
SejEurch Launcher site. (http://searcMaunchenbcm:.tmc.edu/seq-^^^ " . 



Gene 


NCBIprotein 


•#. 


MW 


pi 


PredictjHl 


TraBcated 




bdI . 


amino 


(kd) 




transmem- 


protein 






acids 






brane domain 




At3g<SH30 (JS36) 


MP 191672 


673 


77.4 


9.95 


N22-44^' : 


"^42-^73^ 


At2g38650 (JS33) 


NP_565893 


.61S 


69.7 


8.63 


'"23-45'-' 




At5g47780 


NP^568688 


616 


71.1 


9.26 


"6-22^ 


"26416'' 


(JS36-Uke) 















The trancated forms of JS33, JS36 and JS36L in a pEAK vector (Edge BioSystems as 
inodified by Moremen lab, CCRC) contaiiiing a heterologous signal sequence, a histidme OHZS) 
tajg, and two influenza hemagglutenin (HA) .'epitopes, were transiently expressed in kuman 
embryonic kidney, cells (HEk293 cells) for 46 hours. The media Was collected. . Since the 
translationsd fusion proteins cbnstmcted contained two copies of the HA epitope, tbe media 
surrounding the transiently infected cells was incubated with a mouse anti-HA IgGl that was 
bound to Protein A Sepharose and the immuhoabsorbed protein was assayed for QalAT activity. 
Table IV-A shows that the JS36 construct expressed GalAT activity. The experiment was 
' repeated using Protein G Sepharose (Table IV-B) as an immunoabsorbent for the mouse anti-HA 
IgGl (since Protein G Sepharose is reported to bind IgGl more effectiyely than Protein A 
.Sepharose). The results show that JS36 again expressed GalAT activity, although the activity 
was lower, suggesting that the enzyme was losing activity over time.'. From these results of the 
transiendy expressed protein we conclude that JS36 is a GalAT and have named the gene 
GALATl in ' accordance with the rules put forward by the "Corhmission on Plant Gene 
Nomeqclatiire • fhttp://genome-www.stanford,edu/MendelA and The Arabidopsis Infomiation. 
Resource (TAIR) rhttp://www.a rabidODsis.org/noniencLhtml: 

http://www/arabidopsis. org/info/guideIines.html#^enesV ' 
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Table IV. Anti-HA epilope:protein A Sepharose immunoabsorbed protcdn from media 
surrounding man^malian HEK293 cells transiently expressing recombinant putative GalAX. 

Truncated JS36, JS33 and JS36L genes inserted into the mammalian expression vector pEAK, altong with 
a pEAK vector only control, were independently transfected into human embryonic kidney cells 
(HEK293 cells). The transfected cells we^ grown for 46 hours to transiently express the recomMaant 
proteins and the media snrrounding the cells was incubated with anti-HA IgGl bound to either RnDtein A 
Sepharose (A) or Protein G Sepharose (B). The inmiuQoabsorbed protein was assayed for GalAT activity 
using UDP-['/^C]GalA and oligogalacturonides as substrates. The data are the average [^''CIGalA 
incorporated mto product (minus TO controls) from duplicate 45 min (A) or triplicate 60 min G^) 
reactions. Time 0 controls are reactions to which base was added to inactivate the enzyme prior tB» the 
addition of substrates. The GalAT activity in solubilized Arabidopsis protein was assayed as a positive 
control. , ; . ' ' . 



Enz^e 


(A) 

[ QGalA incorporated 


(B) 

(''*C]6aIA incorporated 


Recombinant truncated JS36 


161 


H2 


Recombinant truncated JS33 


30 


.0 


Recombinant truncated JS3GL 


0 


0 . 


Vector control . 


0 


4 


Solubilized Arabidopsis enzynie(control) 


"1434 


550 



As nientioned above, analysis of the gene sequence of GALATl shows that the predicted 
expressed protein contains one transmembrane domain. This is in agreement with the GalAT . 
activity being membrane bound in all species tested (see appendix^). Furthermore, the predicted . 
topology of GALAT 1 is that of a type-II membrane protein, in agreenjent with our jprevious - 
determination, that the catalytic site of pea GalAT lies in the lumen of the Golgi, Type-II . 
membrane proteins have a short N-termihal cytosolic tail, a transmembrane region, a stem legion, 
and a C-terminal catalytic domain^^. 

GALATl is a member of the Glycosyltransferase Family 8 in the CAZy database 
[database of putative and proven carbohydrate modifying enzymes that currently contains 61 
different proposed glycosyltransferase families (http://afmb.cnrs-mrs.fr/CAZY/index.htnil)^'^]. 
The presence of GALATl in Family 8 is in agreement with our demonstrated activity of _ 
GALATl as an al,4-gaIacturonosyltransferase, isince Family 8 is a family of proposed retailing 
glycosyltransferases and GALATl is a retaining enzyme (i.e- the'a-configuration in the substrate 
UDP-a-GalA is retained in the product al ,4-linked-galacturononan (HGA). 

Of the 194 putative glycosyltransferase proteins in Family 8, only two have detemnined 
.3P crystal structures: an al,4-galacto$yltransferase from Neisseria meningitidis and glycogpnin. 
A'pairwise blast shows that GALAT has no sequence similarity with glycogenin but does have 
sequence similarity. with argalactos yl transferase LgtC (22% identity and 43% similarity over a 
1 89 amino acid region). The^ Neisseria meningitidis GalT LgtC adds a-galactose. from UDP-Gal 

. to a terminal lactose acceptor in a bacterial lipooligosaccharide acceptor to form an al,4-linked 
galactose. Thus, the nucleotide-sugar substrates used by these two enzymes differ only in the 
UDP-Gal having a hydroxyl rather than a carboxylate at C6. The acceptors, however, are 

. structurajily different. 

. * Interestingly, when we used the GALATl amino acid sequencb to search for predicted 
prptein structure using the GenTHREADER fold prediction program%n the PSBPRED protein 
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structure prediction server^^ fbttp:/A?ioinf.cs,uc].ac.uk/psipred/^, the only crystallized protein to 
shown significant similarity was PPB entry 1GA8 which was the family 8 NeisseriaGaiTTLgtC^^ 
mentioned above. GALATl is predicted to fold very similarly to the GalT LgtC. Thr large N- 
terminal domain of GalT LgtC (which is similar to the C-terminal domain of GALATl (ixl the 
presumed catalytic domain) has a rhixed oc/p fold. Based on our comparison of the oonsenfed 
amino acid sequences between the two proteins and the identity of specific amino adds that 
interact with the nucleotide-sugar and acceptor a^ determined by the crystal structuie of Gairr 
LgtC in complex with aXJDP-Gal ?mal6g and the acceptor, we propose that in GALAT the amino 
acids D498, D500 and H635 are involved in complexing with Mn^; while D499, G638, KB4.1 
are involved in binding to UDP; and R48 1 is involved in Innding Gal A in the acceptxar. Urns, 
these amino acids will be among the first targeted for site directed mutagenesis (see IzelowX It is 
also interesting to note that GalT LgtC binding of UDP-C5al appears to induce a conformational 
change in the protein, suggested induced fit. The nucleotide-sugar binds in a deep, solvent 
shielded cleft in GalT LgiC such that the UDP-Gal is almost entirely buried within tbeen^me, 
possibly explaining the requirement for the nucleotide-sugar in order to crystallize GaKT LgjC^^. 



U: ;:-JS33 ; - JS36 ■. JS36L' 



Fig: fiA UT-PCR OFJS33. JS36 am} JS3L using. 
' Arabltlopsis flower (F)» (R). Stem (S) and 
:icaf CL)nNA. • ■ y -\ ■<:'':^V 



^ F U S L : 
LI ■ W 5^1 JS 1.^ 



Fig.rtB RT-PCRc6nlrol;RT-PCRof. 
AriibUopsis actin geno miu^Arabi'dopsis Houer 
<.Fj» niDt (R), s(cm (S) and leaf (L) RNA. 



GALAT is expressed in multiple Arabidopsis tissues at miiltiple times during 
development. We base this on our RT-PCR analysis of RNA from Arabidopsis flower, root, 
stem and leaf tissue (Fig. 6) showing that GALAT is expressed in all these tissues, and based on 
the 18 EST entries for this gene in the TAIR.database (http:y/w ww.arabidopsis.org/) indicatmg 
that GALATl is expressed in developing seed, green siliques, roots and above ground organs. 

Identification of a proposed GALATl Gene Family * " 

A standard protein blast and a PSI Blast of the NCBI protein database using the GALATl 
(JS36) amino acid sequence reveals that GALATl is a member of a 15 member putative GalAT 
gene family in Arabidopsis. The genes we selected for this family, had > 30% amino acid identity* 
and > 50% amino .acid sinoilarity based on a PSl Blast. We further compared these genes along 
their entire coding sequences with JS36 using a Pairwise BLAST (Table VI) showing that this 
family of genes has > 34% identity and > 52% to JS36 in the portion of the genes C-tcnninal to 
the membrane spanning domain. This identity is comparable to the 37-54% identity shared 
among the proposed ten memhoT Arabidopsis fucosyltransferase gene family (AtFUl-10)^^ 

Of the members of the putative Gal AT gene family, only GALATl has proven function. 
However, it is interesting that recentiy a mutant named Quasimodo was identified^^ that has 53% 
aminp acid identity and 72% sinailarity to GALATL Thus, the gene affected in Quasimodo is a 
member of our proposed putative GalAT family. Quasimodo has 25% reduced levels of GalA in 
the cell wall and reduced cell adhesion, characteristics consistent with a GalAT. However, ther^ 
is not yet enzyine confirmation of the activity of the Quasimodo gene. 
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The conserved amino acids in our proposed putative GalAT gene family are shown In 
Fig. 7. GlycosVl transferases are expected to contain one or more carboxylates at the catalytSs:' 
site. At least one of the carlpoxylates is expected to coordinate a divalent cation associated with 
the nucleotide-sugaTi In many glycosyltransferases the metal coordination involves two* 
carboxylates that are often present as DDx, xDD, or DDD (the so-called **D(x) P" mQ&f\ We . 

predict. 



550. 



4 

. p-i-CL-L . 



t 
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]^gure 7. Sehemcuie representation of transmembrane, spanning region and conserved amino 
acids in the Arabidopsis thaliana putative QqlAT gene family. The relative position of strictly 
conserved residues among the members of the putative GalAT family are numbered as for JS36 
(Le^GALATl).' The white speckled region from residues 22-44 represents the predicted 
transmembrane region. 
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within the putative GalAT gene family, that D498 and D500 are part of the •T)(x) D* nwtif. 
However, additional structural and mutational data is required to confirm this. 

Although it is predicted that retaining glycosyltransferases have a covalent medEanisnD for 
.catalysis, in wlidch the glycosyl residue is covalently transferred to the enzyfne from the 
nucleotide-sugar prior to transfer to the oligo/polysaccharide acceptor^^, recent structuntf analysis 
of bovine (a-l,3-GalT) opens the possibility thati at least for bovine a- 1 ,3-GalT, a sequential 
ordered binding mechanism may occur^"*. Clearly, additional stmctural data from other retaiuing 
glycosyltransferases is needed to determine the range of enzyme mechanisms used by idsainiiig 
glycosyltransferases. If time allows, part of the proposed research is aimed at obtaining x-ray 
ciystallogrkphic and NMR 3D structural data on AraWcip^ 

Table VI. Pairwise sequence alignment between. JS36 and the other members of proposed GalAT gene 
family. The alignment was done using the NCBI Pairwise BLAST and Matrix. BIosiim62, The % anmo 
acid identity and similarity are shown. In all cases the alignment compares the buUc of the C-tesminal 



Gene- 


NCBi protein U) 


% 


%Similarity 


**t3a6ll30 (GALATl: JS3S\ . 


NP 191672 


100% 


100% • 


At5e47780 rJS36.1ike1 


NP 568688 


63% 


81% 


At2p46480 


NP 182171 


61% 




At4ff38270 


NP 195540 


55% 


73% 


At3fi'25140 (ntmsimodo\ 


NP 18^150 


53% • 


72% 


AtlRl8580 


AAK93644 


48% 


67% 


At3B02350 


NP 566170 


47% 


66% ■ 


A12b20810 


NP 565485 


46% 


68% 


Atlff06780 


NP 563771 


44% 


. 63% 


At2R30575 


NP 671901 


43% 


65% 


At3e01040 


NP 186753 


42% 


61% . 


At5ffl5470 


NP 197051 


42% 


61% 


At5ff54690 


NP 200280 


38% 


60% • 


.At2ff38650 f T5?33V 


NP 565893: 


36% 


60% 


At3R58790 


NP 191438 . 


34% 


52% 
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RATIONALE AND SIGNraCANOE 

Our identification of the GALATl gene in transiently transfected mammalian. ceUb- now 
allows the production of stably transforaied cell lines that produce GALATl and experiments 
aimed at characterizing the mechanism of the enzyme and at determining the role of GALATl jai 
pectin synthesis. The successful completion of the proposed research will provide the fiisH 
biochemical characterization of a GalAT from any species. Specifically, the substrate spccificitir 
Of GALATl will indicate whether it catalyzes only HGA synthesis, or also plays a role ioRG-I 
and R.G-II synthesis. Characterization of the kinetics of GALATl will clarify whether 6nnot 
UDP-GalA is both a substrate and an alios teric regulator of the enzyme. Characterization of the 
mutated GALAl enzyme will provide information regarding amino acids important in caflalysis 
and substrate binding. The subcellular location of GALATl will provide the first framewBirk for 
where, within thfe Golgi and plant endomembrane system the complex series of pectin 
biosynthetic reactions occur. If GALATl knockouts with a phenotype are obtained, the RNAi 
experiments will give inforination regarding the rol^ of GALATl in pectin synthesis/ potntially 
provide novel biosynthesis acceptors, and provide infonnation about the role of pectin in plant 
growth and development. If time allows, this biosynthesis framework will be extended by the 
identificatioil of GALATl binding proteins that would be putative pectin biosynthesis conoplex 
members. The successful completidn of these studies will set the foundation for our futuro goals 
of a full in vitro reconstitution of functional pectin synthesis complexes. The proposed research 
will also compliment other research in the lab regarding the potential developmental and/cr 
tislsue specific expression of the GALATl gene,' at determining the in vivo role of GALTAB via 
production of plants expressing altered level of GALATl transcript, and at det^inining ths 
function of the other members of the proposed. GALATl family. . 

GALATl has high sequence similarity to M othe^rArabidopsis proteins (the proposed 
putative GalAT family) and to proteins expressed in other plants. Possible GALATl honwlogs 
in other plants are a 68 kd protein expressed in Cicer aneftViMm (chickpea) epicotyls (76% amino 
acid identity; 87% similarity), a hypothetical protein from Oryzasativa (jappnica) (59% ideixti^ 
75% similarity) and a protein froin Populus alba (49% identity; 72% similarity). Thus, the results 
from the study of GALATl in Arabidopsis will likely extend to other plants. Including those of 
high agricultural value. . 

EXPERIMENTAL Pli AN 

Specific Aim.l. Stable expression of GalATl in a heterologous expression 

system a&d detailed characterization of the enzyme. 

• . • \ • 

Heterologous expression of GALATl * 

111 the progress report, we described results in which media from human embjcyonic 
Jddney (HEK293) cells transiently infected with recombinant tnincated GALATl expressed 
GALAT. Whereas transient expression allowed the expression of sufficient GALAT to measure 
GalAT activity, additional expression strategies are required to produce large quantities of 
GALATl required for further characterization of the enzyme and for antibody production. Since 
the transiently expressed N-terminal epitope-tagged GALATl expressed in nianunalian cells was 
active, one strategy will be to produce stable, transfected clonal HEK293 lines^^ using the 
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described recombinant truncated GALATl. The alternative strategy will be to express thefiill 
length and N-terminal truncated forms of GALATl in the fungal expression system Pichm 
pastoris. These $y.stems weie"cho$en since we and others^^^^ have successfully used thenD to 
express plant glycosyltransferases. $ince it is not possible to predict, a priori, which of the 
systems will allow us to produce high levels of functional GALATl, we will initially mate 
constructs in both systems. 

Kelley Moremen, a faculty at the CCRC routinely expresses proteins in^inan[unalkn cells 
and lias advised us on the transient expression of GALATl in mammalian HEK293 cells. He 
will also advise us on the production and cloning of stable tranfected cell lines (see letter of 
support). The basic protocol will be as in the Edge BioSystenis protocols with the modification 
suggested by the Moremen lab.* • 

.• For expression in P. pasforis^ cDNA encoding the entire, and the truncated soluble forms 
of, Gal AT will be gen^ated by PGR using gene/vector specific primers. The PGR producOs will 
be subcloned into appropriate Pichia expression vectors (Invitrogen)in which the cDNA is 
inserted downstream from an alcohol oxidase (AOXl) promoter. We are in the process of 
making full length coding sequence constructs for expression in the Pichia vector pPIG 35L This 
vector does not contain an epitope tag. We will also make epitope tagged GALATl constnacts in 
the Pichia vectors pPICz and pPICza (Invitrogen) and determine whether functional C-tenminal 
epjtppe-tagged constructs th.at do not effect GalAT activity can be recovered. We have alisady 
cloned the fiiU coding sequence of GALATl into the plasmid vector pCR2. V and are preseartly 
inserting the gene into the pPIC3.5 vector. Pichia will be transformed with the linearized 
recombinant construct; Abundant expression of the recombinant protein will be induced bjc 
growth in methanol. Cells will be plated and screened for growth under selection using ah 
appropriate construct/selection scheme. The selected colonies .will be assayed for GalAT 
activity. Approximately 10-20 GalAT positive colonies will be quantitatively assayed using the 
•radioactive filter GalAT assay^^. .. Advantages of thc Pijchia expression system include hi^ 
levels of pjrotein expression (up to 12 gm L*'), the relative low cost of expression in Pichia 
compared to expression in mannonalian systems, and the demonstrated success of Pichia for 
'^xpressipn of glycosylfcransferases^^^^ Several labs at the CCRC are also currently using the 
Pichia system^^"^^ and have successfiilly expressed plant enzymes. Once a high-GALATl- 
producing line is recover^, production of large amounts of protein will be carried out in 
fennentors^ or spinner flasks (available at the CCRC and at the University of Georgia 
Fermentation Research Facility). ... 

Characterization of Expressed GALATl 

To begin to address how HGA is synthesized, , the kinetics, substrate specificity, and 
stnicture of the purified' and recombinantly expressed GALATl will, be detemiiined and 
compared to the . solubilized membrane-bound Arabidopsis GALAT purified by 
immunoabsorption using the polyclonal-*antiGALAT} (see below).- Although the characteristics 
of GALATl are consistent with the enzyme being the/a catalytic subunit of the HGA synthase, 
GALATl could be a GalAT involved in RG-E or RG-I synthesis. For example, GalAT could 
represent an RG-I:GalAT that initially elongates HGA by a single GalA and then waits for a 
required NPP-Rha to start RG-I backbone synthesis. The kinetics of purified and recombinantly 
expressed GALATl for UDP-GalA and a size range of hpmogalacturonan and pectin acceptors 
• will be determined. The. effect of other nucleotide-sugars and oligosaccharide substrates on 
GalAT will aI$o tested to identify activators and inhibitors. Our results with tobacco GalAT 
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indicate that UDP,.UDP-GlcA. UDP-Gal, and UDPrP-GalA are not substrates but rathor inhibit 
GalAT. We will confinn these observations with the recombinantly expressed GALATl and 
also test UDP-Xyl (substrate for xylogalacturonan) and UDP-Rha^ (substrate finr RG-I 
backbone) (gift of Maor Bar-Peled, CCRC). 

Th& expressed full length and truncated enzymes will be assayed in reaction buffer in the 
presence, and absence, respectively, of Triton X-100. The kinetics of the enzynie for UMP-GalA 
will be carried out in a total of 1 iiM to 80 mM UDP-GalA + UDP-C^'^CJGalA. We routinely 
synthesize JJDP-[^'*C]GalA either by the 4-epiinerizati6n of UDP-[^'*C]GlcA* or oxidation of 
UDP-(^'^C]Gal^'* since UDP-E^'^CiGalA is not. comniercially available. The effect of different 
acceptors on GALATl activity will be cohductedl )using 100 pM UDP-GalA and O.'l-iOO fig 
acceptor/ 30 jil reaction. The acceptors that will be tested include HGA oligosaccharides 
(oligogalacturonides) of degrees of polymerization ranging from 2-16, pplygalacturoBic acid, 
commercially available citrus pectin of -30, 60 and 90% esterification, RG-I and RG-IL The 
products made using the different acceptors will be characterized^. If RG-I is shown to serve as 
an acceptor, RG-I backbone fragments that have a GalA or a Rha at the non-reducing emsL (gifts 
of Tad^hi Ishii and Vons Foragen) will be used to determine acceptor specificity. The acceptors 
will be tested using both the precipitation assay^ and the filter assay^^. The enzymes will also be 
characterized for.pH optimum, temperature sensitivity, and effect of reducing agents, iSvalent. 
cations and salts. Finally, we have frequently observed an exopolygalacturonase activity in some 
of our most purified. Gal AT preparations from Arabidopsis, Exopolygalacturonase has also been 
reported in Gal AT preparations from petunia*^ and Azuki bean*"*. We will assay expressed 
GALATl using a nonradioactive GalAT-PAGE assay*^ to deteraiine if the expreissed enzyme has 
any exopolygalacturonase activity. 

Characteristics of the recombinant truncated GALATl , will be compared to the GAIATl 
solubilized from Arabidopsis membranes by immunoabsorption of the solubilized GALATTL 
using anti-GALATl antibody (see section below) bound to Protein A or G Sepharose, or Iqr 
coupling the anti-GALATl antibodies to3M-Emphaze resin^ and using the resin used to purify 
GALATl from SQ\yxh\lyzQd Arabidopsis enzyme. If the characteristics of the immunoabsca&ed 
Arabidopsis GALATl are different from the recombinant truncated GALATl , we will .anaByze 
the immunoabsorbed GALATl by LG tandem mass spectrometry to determine if additionaS 
proteins were immunoabsorbed with iho Arabidopsis solubilized GALATl that may have 
modified the activity (e.g. a heteromeric cornjplex). 

The recombinant GALATl and the GALATl inununoabsoibed-from Arabidopsis 
solubilized mernbranes wU also by treated with iV-glycanase to determine if they are 7V- 
glycbsylated. To determine if they are O-glycosylated, the proteins will be exhaustiyely.treated 
with JV-glycanase, the released oligosaccharides removed by passage of the treated protein over a 
CI 8 SEP-PAK cartridge, and the resulting protein analyzed by TMS methylation analysis to 
detemdne the glycosyl residue composition of any carbohydrates still attached to the protein. 
Any oligosaccharide released by the N-glycanase treatment would alsQ be analysis by TMS 
methylation. The results of these experiments would indicate whether the mXiye Arabidopsis 
GalAT is glycosylated .tod whether the recombinant fonns have the same or different 
glycosylation pattern. Changes in glycosylation could effect GALATl enzjrme activity and/or 
substrate bindings GALATl is predicted to have 5 or 6 iV-glycosylation sites (NetNGlyc 1.0 
Prediction; http://wvvw.expasy.org/sitemap.html). 
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As mentioned above, we have found that membrane-bound and solubilized Gal AT 
activity in tobacpo and radish has unusual apparent biphasic kinetics. Thus, we are particulsirly 
. interested in determining if the expressed GALATl shows the same kinetics, including possible 
allosteric regulation by UDP-GalA. Such results would indicate likely 4** structure. WewflB-test 
for possible multimeric structure by determining the mass of the enzyme by size exclusion 
chromatography and comparing these with the mass obtained by SDS-PAGE. The possibilifiy 
that GALATl exists as a heteromultimer will be tested by mixing expressed GALATl with 
solubilized Arabidopsis enzymes and immunoabsorbing GALATl and proteins bound to it rasing, 
either ah anti-GALATl antibody or an anti-HA epitope antibody (see previous section). 

Specific Aim 2. Production of a series of mutated GalATl proteins by site- 
directed mutagenesis of the GALATl gene and heterologous expression of the 
mutated proteins. 

The long term goal* of the research proposed in this section is to determine the role, m 
GalATi, of the 45 strictly conserved amino acids among the 15 members of the putative GaDAT 
family. However, in the jpresent proposal we will initially target, by site-directed mutageneas,. 6- 
10 amino acids likely involved in substrate/acceptor binding and/or catalysis. We will proceed to 
the other amino acids as time allows. The strategy will be to systematically mutate selected 
residues in GALATl to determine their roles in GALATl activity. The effect of these mutattfons 
on GALATl specific activity, and where warranted, on Km, Vmax,.and acceptor specificity ^.e. • 
OGA, RG-I and RG-IO and product size (i.e. enzyme processivity) will be determined. As noted 
in the introduction, we already propose a function for some of the amino acid residues based on 
analogy to their functions in &e GalT LgtC (the most similar glyco^Itransferase for which aiSD 
structure with substrates bound is available). We propose that D498, D500 and H635 are 
involved in complexing with Mn**"; D499, G638, K641 are involved in binding to UDP; and 
R481 is involved in binding GalA iri the acceptor. Like many glycosyltrahsferases. Gal AT 
requires Mn**^^'^. Based on the GalT LgtC structure, we propose that Mh*^ is coordinated by tfie 
two phosphates oxygens of the UDP as well as by side chain oxygen from a D498, two siderfiain 
oxygens, from a D500 and the ring hydrogen of H635. The structurally conserved amino acicfe 
between GfALATl- and GalT LgtC are all invariant in the 15 members of the {Proposed GralAT' 
family except for D449 (invariant in 13 out of 15 ge;ies) and R481 (invariant in 14 out of 15 
genes). We propose' that these residues are involved in catalysis and/or binding to UDP-GalA and 
HGA. It is also possible that some of the residues are required for structural integrity of GALAT " 
or for binding of allosteric effectors. The mutations we propose to test initially will include: 
D498N, D500N, H635A, D499H (two of the family members have a His at this site), K641A and 
R481S. We will also mutate D563N, L564G, W567A, K:568Q and K569Q since we hypothesize - 
that these Mxino acids may be involved in.catalysis or acceptor binding based on their 
gonservation and/or similarity within the proposed putative GalAT gene family- 
Site directed inutagenesis of trundated, and possibly full length (if we can get the full 
length GALATl expressed in a functional form) GALATl will be at a specific at amino. acid(s) 
using the Pro'mega GeneEditor in vitro site-directed mutagenesis system. The GeneEditor system . 
has the advantage over the Altered Sites II kit in that iany vector containing the wild type TEM-1 
beta lactamase gene (i.e. ampicillin resistance gene) can be used. Most cloning vectbrs, 
including those we are using for expression in mammalian and Pichia cells, have this antibiotic 



15 



lesistance gene. The mutant oligonucleotides to be used will include single base substitutions 
but eventually may also include multiple nucleotide substitutions or deletions. The Promega 
GeneEditor system can be used for muldple substitutions, ddetibns and insertions. 

Specific Aim 3. Establishment of the subcellular location of GALATl using 
anti-GALATl antibodies and GFP fusion proteins. 

Production and use of antibodies 

Anti-Gal AT antibodies are necessary for the proposed immunocytochemistiy 
experiments, to inmiuopurify solubilized GALATl from Arabidopsis^ and to select proteins tbat 
potentially bind to GALATl and may function in pectin biosynthedc enzyme complexes. The 
goal is to generate antibodies against the heterologously expressed truncated GALATl, although 
if we find that the fulMengdi GALATl gives significantly more or greater synthetic activity, w 
may make antibodies against the full length forms as well. * In theoiy one can make either 
polyclonal or monoclonal antibodies. However, polyclonals have the advantage that one can * 
expect multiple antibodies with multiple points of interaction against GALATl. Such antibodies 
would be useful for a range of types of experiments, including the subcellular 
• immunocytochemistry, immunoprecipitation/absorption, and enzyme activity inhibition studiesi. 
Thus, our initial stratjsgy will be to generate polyclonal antibodies. A secondary strategy wouM 
be tcy generate monocldnal antibodies if the* polyclonals do not funcdon for the proposed 
experiments. . . * 

Polyclonal antibodies will be generated in two ways to enh^ce the likelihood of 
obuuning useful GALATl specific antibodies. Our main strategy will be to use purified 
lecombinantly expressed GALATl directly to generate rabbit polyclonal antibodies.- While this ' 
should supply polyclonal antibodies that will recognize GALATl, the possibility exists that, dsDs 
to the sequence similarity of GALATl with other putative GaJATs in our proposed putative 
GalAT gene family, the antibodies may also cross react with other members of the proposed gf^e 
family. Thus, we will also immunize rabbits with an eight-branched, multiple antigenic protesoi 
(MAP)^^ synthesized based on anN-terminal less- well conserved re^on of GALATl (i.e. a 14- 
15 non-conserved amino acid region N-teraiinal to the conserved C-temiinal region). Colleagues 
at the CCRC have successfully used this approach to inake anti-CochlioboIus specific anti- 
endopolygalacturonase antibodies*^ While the strength of the MAP system is its ability to 
generate specific anti-peptide antibodies,, the ability of the MAP generated antibodies to cross- 
react with the entire GALATl protein may be more limited. .iThus, itmay be necessary to make 
multiple MAPs based on multiple non-conserved regions. The MAP will be synthesized at the. 
Molecular Genetics Instrumentation Facility- University of Georgia, using the 8-branch Fmoc 
resin as described^^ The polyclonal aoitibodies will be raised at the Um^ . • 

Polyclonal Antibody Production Service using GALATl that we purify and provide. 
Monoclonal antibodies may be generated in mice and/or rabbits as a secondary strategy if 
necessary^'. Dr. Michael Hahn has extensive experience in the generation and screening of 
antibodies and has agreed to advise us (see attached letter). The generation of murine 
hybridomas will be carried out at the Monoclonal Antibody Facility located in the School of 
Veterinary Medicine at the University of Georgia. 

Polyclonal serum'or hybridomas that secret- antibodies against GalAT will be identified 
using a two-step procedure. ITie initial screen will be an enzyme-linked inmiunosorbent assay^ 
and western blotting agsunst puriSed recombinant GALATL The specificity of the antibodies 
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will be demonstrated by Westem-blot analysis of expressed GALATl and of solubilized proteins 
from Arabidopsis cell suspensions. Positive anti-GALATl antibodies will also be subjected to 
second round screening of the serum or hydridoma supematants with GALATl followed by 
iramunoabsorption using Protein A-Sepharose^^-^°. Anti-GalAT antibodies will be identified by 
a loss of GALATl protein and GalAT activity in the supernatant and an elevation of GALATl • 
protein .and GalAT activity in the pellet compared to preimmune controls. If monoclonal 
antibodies are used, anti-GalAT-producing hybridomas will be subcloned and antibodies 
produced in ascites culture.. The antibodies will be purified on protein G-Sepharose, and 
isptyped using a mouse monoclonal antibody isotyping kit (Sigma). ^ 

Subcellular localizatio n of GALATl 

As mentioned in the introduction, all available data, including the localization of die 
catalytic domain of GalAT in the Golgi lumen, Suggest ttiat pectin is qmthesized in the Golgi andl 
transferred via vesicles to the walL However, what is not known is hov/ the different 
glycosyltransferases function to make specific pectin structures. Our expectation is that differentt 
glycosyltransferases are localized in a sequential manner to different cistemae of the Golgi^^Vim 
an order indicative of the order in which pectin is syntiiesized as it moves from the cis, through 
the medial and to the trans Golgi. Evidence from both animal^^ and plants^"^ suggests that, 
either individually or in combination, the transmembrane donaain (i.e. the bilayer thickness 
model^^), the N- or C-terminal sequenced flanking the tranimembrane domain, and/or the 
lumenal domain (i.e. the 'kin recognition n^odel'^^) contribute to localization of proteins within 
the Golgi system. We propose to determine tfie subcellular localization of GALATl within the 
Golgi by immunocytochemistiy in order to provide additional information pn role of GALATl in 
pectin synthesis. For example, a location of GALATl in the cis and medial Golgi cistemae 
would be consistent with a function of GALATl in HGA synthesis, while a localization • 
primarily in the late medial or trans Golgi would be more suggestive of a role m RG-I or possible 
RG-ir synthesis. It should be noted that such subcompartment localization studies, while 
important and novel for die pectin biosynthetic enzymes, are also novel in any species, since "the 
precise location of only a small number of the glycbsyltransferase proteins witiiin the Golgi 
. apparatus have been determined**^. 

Anti-GALATl antibodies will be Used to identify where" in the Golgi GALATl is , 
localized. Both developing Arabidopsis seedlings, and growing suspension cultures will be used 
as a tissue source since botti.tissues include cells actively making wall and since Michael Hahn 
and Glenn Freshour at the CCRC & Plant Biology Department have already established 
procedures to obtain excellent immunocytochemistry results from these tissues. The tissues will 
be prepared in two ways. For chemical fixation, four day old Arabidopsis seedlings with roots, 
or exponentially growing Arabidopsis suspension cultures will be fixed with.potassium 
phosphate buffer (KPB) containing 2.5% (v/v) glutaraldehyde, the root tissue or suspension cells 
removed and washed witii KPB, put through a graded etiianol dehydration series, and infiltrated 
witii LR White embedding resin as described'^. Alternatively, tissue will be prepared by hijgh- 
pressure freezing and freeze substitution^^'^^. Cells or seedling root tissue will be suspended in 
25% dextran (a cryoprotectant) and concentrated^^'^^ The cells will be transferred to staple 
holders pretreated with lecithin/chloroform, allowed to dry, frozen under high pressure using a 
Balzeis HPMOlO High Pressure Freezer (available at the University of Georgia Electrpn 
Microscopy facility. Department of Plant Biology) and stored in liquid nitrogen until use. 
S2umpJ« will be freeze-substituted by transfer to precooled cryo-substitution vials containing 2% 
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osmium tetroxide and 8% dimetho;gqpropane in anhydrous acetone with substitution for 2-3 days 
at-80**C in an acetone dry ice bath. The samples will be warmed slowly, the substitution media 
discarded, the samples rinsed with acetone, aixd the tissue infiltrated with LR White resin^^'^^. 

Thin sections (<100 nm) will be' cut using 'an MT eOOO-XL ultramicrotome, the sections 
collected on Formvar-coated, gilded copper slot grids and the dried sections immunolabeled' . 
following hydration as described^. The sections will be blocked for non-specific binding by' 
incubation in 3% (w/v) nonfat dried milk in KPBS and the sections incubated in the primary 
GALATl antiserum diluted in KPBS or with a pre-immune control antibody. The sections will 
be rinsed with KPBS and labeled by incubation on droplets of secondary anti-rabbit IgG 
conjugated to 15-nm colloidal gold^. After immunoiabelling the secdons will be poststained 
widi 4% uranylacetate apd then lead citrate^^ and examined with a Zeiss EM 902A Electron 
microscope. 

We will use the morphology definitions of Zhang and Staehelin to distinguish the cis 
from the medial side of the Golgi^^ However, the cis versus the trans end of the Golgi stacks 
will also be distinguished by inununolabeling some sections using and-clathrin antibodies that 
label preferentially the trans-Golgi network^'. Different sized gold particles will be used to 
distinguish the different antibodies. Michael Hahn h^s extensive experience in the use of 
antibodies for iiiununocytochemistiy and has. agreed to advise us (see attached letter). 

Specific Aim' 4. Use RNAi to generate GALATl knpckopts and characterize 
the phenotype of the mutated plants- . 

Double-stranded RNA-mediated interference (RNAi) is a method to study the function of 
genes in plants^°^» Transgenic plants harboring an RNAi construct often have reduced expression 
of the gene-specific naRNA. The resulting plants may display either complete gene silencing^ 
thus having a knockout phenotype, or a partial "knockout" phenotype due to *leaky* expiessioti. 
The RNAi approachi should allow the suppression of GALATl expression and a reduction or loss 
of GALATl. This will allow us to elucidate the function of GALAT in pectin synthesis and in ' 
the plant. We will also continually monitor the T AIR' database to determine if any T-DNA insert 
lines for GALAT become available.. If so^ we will' order the seed and characterize the mutants as 
described below. 

. To generate GALAT knockout phenoQ^es using RNAi gene silencing, complementary gene- 
specific coding regions uiiique to GALATl (i.e. not found in the other gene family members) 
will jbe generated by fusing one sense region (-200 nt) to its identical region in the antisense 
orientation using GUS as a linker. A 700-bp partial coding linker DNA derived from the GUS 
gene will be separate the two complementary gene fragments. The DNA constmct 
(sense):GUS:(antisense) will be generated by PGR and ligated into a Bluescript-^based vector 
between the 35S cauliflower mosaic virus promoter and the octopine synthase temndnator. The 
resulting expression cassette will be ligated into a binary vector, the vector transfonried into 
Agrobacteriufn tumefaciens by electroporation/and the transformed A. tumefaciens used to 
transform Arabidopsis thaliana ecotypt Columbia by vacuum infiltration*^^ Tl seedlings will 
be selected on agar plates containing the appropriate antibiotic/herbicide and resistant seedlings 
will b^ transfeired to soil. T2 resistant seed will be collected from independent lines. Maor Bar- 
Peled, a colleague at the CCRC, routinely uses these methods in his lab will advise us as 
necessary. Transgenic homozygous plants will be selected and characterized (assuming they are 
not lethal). Of particular importance regarding pectin synthesis, the cell walls will be isolated 
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and analyzed for glycosyl residue composition and linkage to provide information about the 
possible role of GALATl in pectin synthesis. 

Future research. If time permits, we will initiate two types of additional studies. 

Identifying members of proposed HGA biosynthetic complexes. There is growing 
evidence that glycoconjugates are synthesized by compilexes of glycosyltransferases and other' 
types of proteins^^^. For example, ganglioside synthesis occurs via a tightly regulated formation 
of multiple glycosyl transferase coraplexes^^^ If time allows we will attempt to identify protein 
members of proposed biosynthedc complexes by immunoabsorbing such* proteins bound to 
GALATl using anti-GALATl antibodies or anti-HA epitope antibodies. The immunoabsorbed 
proteins would be identified by SDS-PAGE, removed from the gel, and their anaino acid 
sequence detennined by LC-tandem mass spectrometry. 

Determining the 3D structure of GALAtL Studies aimed at determining the 3D structure 
of expressed recombinant truncated GALATl, and full length GALATl would be initiated. 
A long-term goal of the research is to use GALATl (and other pectin biosynthetic enzymes) to 
synthesize pectins with specific structures. To do this an understanding of the 3D structure of 
GALATl and an identification of the amino acids important for nucleotide-sugar and acceptor* 
binding and for catalysis is needed: With stractural infomiation, directed changes in GALATl 
gene/polypeptide sequence can be made with predicted effects on substrate and catalytic 
specificity. Two approaches will.be used to gain information about GALATl 3D structure: X- 
ray crystallography and NMR. Ideally, we would like to have structural data on both the active 
truncated recombinant GalATs and on the fiill length enzyme. Whereas success is probable with 
the trancated forms, 3D structural detennination of membrane proteins is much more 
challenging; aiid NMR methods may be more useful with such proteins. The University of 
Georgia is a research core for the NIH-funded Southeast CoUaboratoiy for Structural Genomics 
(SECSG) fhttp://www.secsg.org/> which is part of the developing Georgia Stmctural Genomics 
Center. This facility includes a robotic high throughput crystal screening (HTP-CS) system in 
the XrRay Crystallography Core. which is headed by Bi-Cheng Wang (PI of the SECSG grant):. 
We will use the robotic system in collaboration with B.C. Waiig to generate GALATl crystals 
and determine GALATl 3D struchire (see letter of support from B.C. Wang). We will also 
initiate studies in collabpration with Jim Prestegard (Co-PI of SECSG, GCRQ who heads the 
. NMR Core of .SHCSG to attempt to use high-field NMR spectroscopy and computer based 
structure predictions to obtain NMR solution conformation data for GALATl . This will involve 
producing appropriate recombinant isotopically labeled GALATl and obtaining residual- dipolar 
coupling data from solutions of. the protein in the presence of substrate/acceptors. Success with 
both studies will require producing sufficient amounts (5-10 mg) of full length and truncated 
GALATl for the NMR and crystallography approaches. These will be generatted ft-om the 
proposed research. 

Results from Prior NRI Support 

in work supported by our previous USDA-NRI grants [9/15/1994-12/31-00] we identified 
and characterized a membrane-bound and soiubilized a-l,4-galacturonosyltransferase from 
tobacco, radish, pea and Arabidopsis*"**^''"^'*^-^-^^'*®^'**^ (see also progress report). 
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PARTII 

' Functional Identification of Genes for Pectin BiosynthetU: Enzymes 
Project Summary 
Ust of Senior Personnel 

Debra Mohnen, PI, CCRC and Department of Biochemistry and Molecular Biology. 
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Maor Bar-Peled, Co-PI, CCRC and Department of Plant Biology, University of Georgiai 
Michael Hahn, Co-PI,. CCRC and Department of Plant Biology, University of Georgia 
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William S. York, Co-PI, CCRC and Department of Biochemistry and Molecular. Biology, . 
University of Georgia 

. Summary of proposed project • . . ' . ' _ 

The enzyme activity pf 32 proteins encoded by putative pectin biosynthetic genes will be detemained. 
.Pectin is a family of structiirally pomplex cell wall polysaccharides that has multiple roles in pBant 
growth, development and disease resistance. Pectin synthesis is catalyzed by glycpsyltransferases that • 
transfer a glycosyl resi[due from an acdVated sugar onto an oligosaccharide/polysaccharide acoqptor. The 
' putative pectin biosyntfiedc genes were identified based on: (1) sequence similarity to the receatiy . 
identi^ed gene (GALATJX an Arabidopsis thaliana UDP-galacturonic.acid: HGA a-1,4- 
galacturonosyltransf erase (GALATl); (2) sequence similarity to KDOtransferases; and (3) seqosence 
. similarity to a putative glucuronosyltrarisferase. To identify enzyme activity (1) each of the gems will be 
cloned, heterologously expressed, and the expressed. protein tested in a multi-acceptor pectin 
biosynthesis assay fMAPA) : (2) the visible phenotype and wall pectin structure of mutant plaiass with T- 
* DNA inserts in the selected genes will be characterized-; (3) polyclonal antibodies to heterologausly- 
expressed pectin biosynthetic enzymes will be generated and used to define the subcellular locadon of ' 
the gene products and to inhibit specific pectin biosynthesis activity in permeabilized-membrams that 
contain' endogenous acceptors. The long-term goals are to identify and characterize genes invoBved in 
pectin biosynthesis. 

Broader Impacts of the proposed research 

The proposed research will lead to the functional characterization of Arabidopsis genes that catsdyze 
pectin synthesis. The geries and their confirmed enzyme activities will be available at anew wdblink 

. entiUed 'Tectin Biosynthesis NSF Arabidopsis 2010 Project" on the CCRC homepage 
(bttp://www.ccrc.uga.edu). . Acceptor molecules and cDNA clones generated during this study Will be 
made available to researchers through a 'Tectin Biosynthesis Resource Manager" funded fromaEiis 
proposal and using the infrastructure of CarboSource Services, a component of the NSF-funded.Plant 
Cell Wall Biosynthesis Research Network (http://xyloglucan.pri.msu,eduy). The identification of genes 
encoding pectin biosynthetic enzymes will allow the engineering of plants to produce pectins wieh 

.modified structures and properties, pectins with improved agricultural value, and pectinrbased 
neutraceutical and phanxiacological products. The participation of historically underrepresented groups in 
the research will be achieved by recruiting undergraduate and graduate students through the University of 
Georgia Summer Undergraduate Research Prograni (SURP) fwww.gradsch.upa.edu/rr/) and the mew 
SURP-Bridge program, which draw in minority students from across Georgia. The latter program offers 
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uicoming graduate students from undeirepresented groups lesearch experience prior to graduate school in 
an effort to develop the students* research skiUs and increase their likelihood of success in graduate 
school. 

L Specific Aims 

We have identified 32 genes in Arabidopsis (see Appendix A-1) that we predict encode 
glycosyltransferases involved in pectin biosynthesis. The putative pectin biosynthetic genes to b& 
investigated in this proposal include a family of IS putative GalAT genes and a family of 10 GaOAT-like 
. genes identified based on their sequence similarity to the functionally and biochemically identi&d 
GALATl (Sterling and Mohnen, in preparation). Two other families of genes were selected basecdion 
seqaence similarity to a putative RG-II gliicuronosyltransfecase^ (the GlcAT family) and to prokDcyotic* 
laDOtransferases,^ (the KdoT family). The goal of this proposal is to determine the catalytic acaSvity 
and substrate specificity of the proteins encoded by these genes. At least SO unique glycosyltransferases 
are likely to be required to synthesize pectin^. The number of pecdn biosynthetic glycosyltransfisases 
may be significantly larger if multiple copies of the enzymes are encoded^ as is likely since at least parts 
of the Arabidopsis genome appear to have Undergone duplication^.' * ' *. 

We recendy identified the Arabidopsis gene {GALATl) that encodes al,4- 
galacturonosyltransferase I (GALATl), a pectin biosynthetic glycosyltnmsferase. GALATl transfers 
galacturonic acid (GalA) from UDP-GalA onto homogalacturonan acceptors^****. Thus, GALATl encodes 
apectin biosynthetic glycosyltransferase. We will use the methodology we developed during tte 
i^ntification of GALATl to characterize the enzyme activity of the putative pectin biosynthetic genes 
- described in this proposal. All 32 genes will be (1) cloned, heterologpusly expressed as epitope-fiagged. • - 
constructs to facilitate purification, and when applicable as N-terminally truncated constructs, anqil 
transformed into Pichia and human. embryonic kidney cells CI^K293 cells). Enzyme activity of tfie 
expressed reconibinant proteins will be determined using a irailti-racceptor gectin biosynthesis asasay 
(MAPA) that allows screening for multiple pectin biosynthetic activities- (2) Antibodies will be 
g^erated agsunst each heterologously expressed protein. For proteins thai give no activity in the MAPA, 
WB will determine whether the specific antibody can inhibit pectin.biosynthesis in permeabilizetfl 
membrane preparations (permeabilized membranes assay, PMA) that have endogenous acceptons^^® 
rather than exogenous acceptors. Antibody-dependent inhibition of specific enzyme activity wiH allow 
us to identify the nucleotide-sugar and/or acceptor substrates for the expressed enzymes and allow us to 
optimize the aibceptor substrates used for MAPA. The antibodies will also be used to determine Ae 
subcellular localization of the gene products using tissues shown, by RT-PCR, to express the mRMA for 
the specific gene. The emphasis will be on immunogold localizations using electron microscopy; since 
the subcellular location of the genes will be .most informative regarding the possible role the .gene 
products in pectin biosynthesis. (3) We also propose to strucnurally characterize the pectin in the walls of 
Arabidopsis plants with T-DNA inserts in the genes of interest. Changes in pectin structure in the 
mutants would indicate the defect in wall synthesis, give information about the enzyme activity of the 
gene product, and allow us to isolate pectic fragments ffom the mutants for use as additional exogienous 
acceptors in the MAPA. To date, T-DNA insertion mutant lines for 19 of the selected genes are available 
in the Salk Arabidopsis T-DNA library fhttp://signaLsalk.edu/cgi-bin/tdnaexpress\ 

The genes to be tested in this proposal, and the confirmed en^me activity of the gene prodhncts, ' 
will be listed at a web link entitled 'Tectin Biosynthesis NSF Arabidopsis 2010 Project" to be established . 
at the CCRC web site (http.7/www.ccrc.uga.edu^ . Acceptor substrates and cDNA clones "will be irade 
available to other researchers through a ^Tectin Biosynthesis Resource Manager**, to be funded from this 
proposal and using the infrastructure of the established CacboSource Services, si component of the NSF- 
funded Plant Cell Wall Biosynthesis Research Network 

flittp!//wYw.ccrc.uga.edu/web/services/caTfaosource/index.htnilV The combined expertise of the PI and Co-PIs 
in pectin st^ucture"■'^ biosynthesis^*^'"'^^*' and function^^^, as well as carbohydrate chemistry, . 
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biochemistry, molecular and cell biology^^*^^ and immunoghenii^try provides a unique breadth and 
strength in multiple research disciplines and methodologies to tackle the challenging task of identi^ing 
pectin biosynthetic genes. The identification of such genes will allow the eventual engineering of piLnts 
that produce modified pectin, that have improved agricultural value, and will likely allow die prodiacction 
of pectin-based neutjraceutical^^;^ and pharmacological'^^*^^ products. 

n. Significance 

. Pectin is the most structurally comple:^ polysaccharide in the plant cell wall. It comprises S(D- 
40% of the primary wall of dicots and non-graminaceous monocots, and ^10% of the primary wall m the 
grass family. Pectins are a family of polysaccharides that include homogalacturonan (HGA) ^g. 
IX rhamnogalacturonan-I (RG-I) (Fig. 2) and the substituted galacturonans including the ubiquitous 
. dianmogalacturonan n (RG-II) (Fig. 3), xylogalacturonan (XGA)**^ and apiogalacturonan Pedtic 
polysaccharides impart structure^ to die growing plant cell wall, regulate ion binding in the wall, 
influence cell-cell adhesion^^**; are involved in cell signaling^^, and may control the diffusion of 
molecules through the apoplast^'. Pectins have been implicated in a broad range of plant growth 
phenomena including pollen tube growth^, seed hydration^*^, leaf abscission , water movement^*, and 
. fruit development". In addition, pectic oligosaccharides serve as signals^^ during plant developniffint^ " 
and induce plant defense responses^^*^. Mutant studies have shown that altered pectin structure leaib to 
dwarfed plants^, brittle leaves^, reduced numbers of side shoots and flower5^\ and plants with redmced 
cell-cell adhesion''^^: The sqccessful completion of the proposed research will lead to the identificaSDon 
of the enzyme activity, and thus biochemical function, of multiple pectin biosynthetic genes, many vff , 
which are likely to be involved in the synthesis of the backbones of the three pectic polysaccharides 
HGA, RG-I and RG-II. The availability of these genes would provide the molecular and biochemicaH 
tools needed to identify additional glycosyltransferases involved in branching of the backbones, and 
woiild allow the generation of plants widi altered pectin structure. While the 32 genes to be studied ik 
this proposal represent only '-0.12% of the «*23,000 genes in Arabidopsis, they ore some of the most 
•difficult genes to identify and characterize because of a lack of cpmmercially available acceptor 
substrates and activated glycosyl donor substrates. 

The GALATl gene has high sequence similarity to proteins expressed in other plants, thus it is 
likely that the other pectin biosynthetic genes will also have identifiable homologs in other plant spedes, 
including agriculturally important plants. Since pectin of overall veiy similar structure is present in Ifte 
walls of all flowering plants and gymnosperms, the identification of functional pectin biosynthetic genes 
will greatly facilitate the engineering of plants with modified pecdn and with altered growth . 
characteristics, some of which are expected to yield plants of increased agronomical value. In additson, 
mutant plants with defined changes in pectin synthesis will allow the dissection of the biological role of 
each pectic component in plants. The pectin biosynthetic genes will also provide valuable tools for 
understanding mechanistically how pectin is synthesized. The glycosyjtransferase-specific antibodies to 
be generated will also allow the process of pectin assembly in the.Golgi to be elucidated. A complete 
understanding of such a polysaccharide cellular trafficking process is unknown in any biological sysfism.. 

None of the'32 selected putative pectin biosynthetic genes are listed anriong the currently funded . . 
2010 projects except for the large s;cale 2010 projects by Ecker ("A Sequence-Indexed Library of Insertion 
Mutations in the Arabidopsis Genome Sequence-Indexed TDNA Library", NSI^Ol 15103) and 
EckeryDavis/Theologis ("Global Expression Studies pf the Arabidopsis Genome, NSF#0196098). AtlglSSSO is 
listed in the project entitled "Expression Profiling of Plant Disease Resistance Pathways" 
(NSF#pil4783) by. Dong, a project that includes a list of -2000 genes. It is unlikely that the Dong sflndy 
will test enzyme activity of AtlglSSSO and thus, our proposed research compliments- that of Dong. 

in. Relation to longer-term goals of the PI and Co-PIs 

The PI and Co-PI' s are faculty at the Complex- Carbohydrate Research Center and all have a lomg 
track-record in studying pectin structuIe"**^ function^^ and biosynthesis^'^""'*^*^. The long-term goals 



of the combined group is to understand the structute of the. complex family of pectic polysaccharides^ to 
decipher how wall polymers, including pectin, are synthesized and assembled into a growing wall, and to 
(ktermine the function of pectin in plants using physical/chemical, biochemical, genetic and molecular 
and cell biology approaches. Thus, for the PI and Co-PFs the current proposal represents a gene 
discovery effoil to supply tools (i.e. genes for biosynthetic enzymes) that will allow detailed mechanistic 
studies on theN>iology of plant cell walls. 

IV. Introduction 
JV,A. Pectin Structure 

. Pectin is a family of structurally complex polysaccharides that contain 1,4-linked CKrD- 
galactutonic acid residues^ Together they account for 30-40% of the primary wall of dicots and non- 
graminaceous monocots, and - 10% of the primary wall in the grass family. Pectin is comprised of 
homogalacturonan (HGA) (Fig. 1), rhamnogalacturonan I (RG-I) (Fig/ 2) and the substituted 
g^acturonans rhamnogalacturonan n (Fig. 3), xylogalacturonan (KGAY^ and apiogalacturonan4''^^ 
(see pdf file entitled "Pectin Review" at http://www.ccrc.uga.edu/'-dmohnen/nsf2010pdffeviewers.htnil> 
for summary of pectin structure' HGA accounts for 55%-70% of pectin*^. HGA is a linear 
homopolymer of 1,4-linked ocrD^galactosyluFonic acid that is often partially methylesterified at the C6 
carboxyl group and may be partially. acetylated at 0^2 and/or O-S' ' (Fig. l). RG-II is a structurally 
complex polysaccharide that accounts for approximately 10% of pectin^ ''^. RG-H has an HGA backbone 
Aat is substituted at 0-2 and/or 0-3. with four stracturally complex oligosaccharides'^*^'^ (Rg. 3). RG-II 
contains 1 1 different types of sugars including D-glucuronic acid (GlcA), D-Kdo (ketodeoxymannp- 
octulopyranosylonic acid), and D-Dha (deoxylyxo-heptulopyranosylaric acid). Rliainnogalacturonan I . 
(RG-I) accounts for 20-35% of pectin^ (Fig. 3). RG-I is a family of polysaccharides with an alternating 
[-44)-Qt.i?-GalA<l— >2)- a-L-Rha-(l-->] backbone in which between 20-80% of the rhanuioses are 
substituted preidominahtly by arabinan, galactan, or arabinogalactan side branches""'**^. Some of the 
side chains are terminated with GlcA or 4-0-methyl-GlcA residues^^*.^^. The composition and length of 
the side chains in RG-I varies betw^een cell types and in different plant species"* . The wall^' of some ' 
plants (e.g. Lenmd) contain apiogalactoronans (AGA) in which the galactoronan backbone is substituted 
at the 2 or 3 position by D-apiofuranose'*''^° Xylogalacturonans (XGA) have a backbone ftat is * 
substituted at 0-3 with D-xylose and are often found in reproductive' tissues""'^:*^*'. 
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03. Substituted galacturonans, such as RG-H, apiogalacturonan ^^^^^ ^ Represenaiivestmcture of rhamnogalacturonan I (RG^i). 

and xylogalacturonan have an HGA backbone. rci an altccnaiing («-»4VouD-Gal/7A-(l-»2). o-L-Rlu^d-^] 

backbone in whlch'rougMy20-809& of the rluinUioses ate substituted 
arabtniins, galacrans. or anbino|EalacCafls. 



Although pectins appear to have multiple roles in plants, the specific mechanism of action has 
not been detennlned. To determine the biological roles of pectins requires, the generation of plants with 
specific alteration's in pectin structure by knocking out genes that encode pectin biosynthetic enzymes.' 



Soch enzymes include the nucleotide-sngar biosynthetic enzymes and Uie pectin glycosyl^sfer^es. 
S^h Syltransferase is expected to transfer a unique glycosyl residue m a specific linkage in an 
^mS and Unlcage-specific manner onto a specific polymenc/oligomenc acceptor. • 
Snohistoche^cal pnd biochemical studies indicate that pecun synthes« . . 

Pectin bibsynthetic genes are most likely sequentially expressed in differort Golgi 

STe^ - in^ ord^S^which 'pectin is synthesized as it moves from the cis. through the medial an< 
to the G6I^^ . -^^^ 20O predicted wall biosynthetic glycosyltiansferases ha. 
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none of these encode 
pectin biosynthetic 
genes. We have 
recently used a partial 
purification-tandem 
mass spectrometry 
approach to identify tEi 
first gene (GALATi) 
encoding a 
demonstrated pectin • 
biosynthetic enzyme, 
GALATl (See below) 
The identification of 
this gene allowed us t 
.identify ^ putative 
GfiiAT gicne family ax 
a putative GalAT-like 
gene family in 
Arabidopsis. 



^ ^?^fu disLt enzyme activities are required to synthesize the nucleotide-sug^ ^^Sfe of " 
« HktwVrfvcosX methyl- and acetyl-transferases are required to synthesize pectin (for pdf file of 
^vfr^SgBi^^tLis-atht fr-^^^^^^^ 

?^n-9 unique galacturonosyltranferases are predicted to be required the syntfiesa of HGA^ R' 
Jrf posSwy for the synthesis of XGA and AGA'. Since, the GALTAi gene was identified ba« 
on^ p "se^e S'L GALA^ 1 protein in partially purified solubilized ^^^-^^'^^^ ^^^^f f 
Lotions that contained UDP-GalA;HGA al.4GalAT enzyme afetm^ 

"^^"^M^^^t^^nT^^^ has been identified and parUa] 

Memorane-Dounu 5 , ,^80^5 :„_iudin2 Arabidopsis (Sterling and Mohnen, unpublished . 

^s^rpS'cj-aiiT-sr^^^^^ 

Lra^dinH the first direct enzymatic evidence that die synthesis of HGA occurs m the Golgi - Jnv^tro, 

It^^Te L «rnAice radiolabeled products with molecular masses between 100 and 500 kd . 

Satle^SS ^thfpuri^^ e'ndopolygalacturonase generated GaU..cUgalacturoni 
IS Sactaronio acid, thereby coirfirming that the synthesized product was HQA^Thus. ntvttro 
S^^e^Cd^ the reaction: tTOP^alA + HGA(n) HGA(n-M) UDP. where HGA can be pu> 
wr r^HGA reS in pectin. The product produced in vitro in tobacco.microsomes was - 50% 



thai the degree of methyl esterification of newly, synthesized HGA may be species specific and that 
methylesterification occurs after synthesis of at least a short stretch of HGA, 

Enzymatically active GalAT is solubilized from membranes with detergent^ Solubilized GalAT ' 
adds Gal A onto the non-reducing end' of exogenous HGA (oligogalacturonide; OGA) acce^^ors of a 
degiees of polymerization 2: 10^. Solubilized GalAT from tobacco membranes^, or detergent- 
permeabilized Golgi from pea'^ elongates exogenous OGAs predominantly by a single GalA residue 
unless high UDP-GalA:OGA ratios are used':"****"» su^esting that solubilized GalAT in vitro acts 
nonprocessively, (i.e. distributively). The apparent lack of in vrVra processivity of GalAT was recently 
confirmed by Akita et aL^ who showed that a solubilized en2^me from petunia pollen can elongate 
OGAs in a "successive" fashion with up to 10 GalA residues, but the kinetics of the reaction suggested a 
distributive (i.e. non-processive) mode of action.. 

We identified a putative GalAT §ene from Arabidopsis using a partially purified euTyme together 
with tandem mass spectrometry. Arabidopsis GjdAT was partially purified frooni a detergentrolubilized- 
enzyme preparation by sequential chromatography over ion exchange and affinity resins. Fractions 
enridied in GalAT activity were treated with trypsin and the amino acid sequences of the pqptides 
determined by liquid chromatography-tandem mass spectrometry. These aminb acid sequences were then 
used to screisn the Arabidopsis gene/protein database. Thirty unique proteins were solely idoDtified. in the 
GalAT-containing. fractions. Aihong the 30 unique proteins that co-purified with GalAT activity, two - 
prot^s (designated JS33 and JS36) were identified as putative GalAT proteins/genes based on their 
having at least one predicted transmembrane domain and a precficted glycosyltransferase domain (see 
CAZy database: httD://afmb.cnrs-mrs.fr/CAZY/index.html\ 

JS33 and JS36 were either cloned by RT-PCR (JS36) using total -RNA from Aro^iWoj^iri^ tissue or 
a cDNA clone was obt^ed (JS33) from the Arabidopsis Biological Resource Center. The piroteins • 
encoded by these genes each have a predicted single transmembrane domain and a high pl as expected 
based on the biochemical characteristics of the GalAT enzyrne^*^*'. The genes were truncated to remove 
their N-terminal region* including all or most of the predicted transmembrane domain and the fruncated 
genes were mserted into the mairunalian expression vector pEAK, that contained a heterolo^us signal 
sequence, a hjstidine (HIS) tag, and two influenza hemagglutenin (HA) epitopes. 

The truncated forms of JS33 and JS36 in the pEAK vector were transiently expressed for 46 . 
hours in human embryonic kidney cells (HEK293 cells). The media was collected and incubated with a 
mouse anti-HA IgG that was bound to.Proteirt A S^pharose since the translational fu§ion protein 
constructs contained two copies of the HA epitope N-terminal to the predicted catalytic domain. The 
immunoabsorbed protein was assayed for GalAT activity. Table I shows that the JS36 constroct • ' 
expressed GalAT activity. These results show that jS36 is a GalAT and we have named the gene " 
encoding this enzyme GALATI (Sterling et al,^ in preparation) in accordance with the rules put forward 
by the ''Commission on Plant Gene Nomenclature (http://genome-www,stanford.edu/MendelA and The 
Arabidopsis Informatidn.Resource (TAIR) ^http://www.arabidopsis .or|a^nomencl.htnil: 
http://www.arabidopsi R nr ff/infQ/guidelinesJitnil#genes) . Stable transformed lines of JS36 and JS33 are 
currently being established to produce large amounts of heterologously expressed GALATI for further ~ 
study. 

Analysis of the gene sequence of GALATI shows that the predicted expressed protein contains a 
transmembrane domain. This is in agreement with the GalAT activity being membrane bound in all 
species examined to date*. Furthermore, the predicted topology of GALATI is that of a type-II . 
membrane protein, in agreement with our previous studies showing that the catalytic site of pea GalAT is 
in the lumen of the Golgi'°. Type-II glycosyltransferase membrane proteins have a short NTterminal 
cytosolic tail; a transniembrane region, a stem region, ^d a C-terminal catalytic domain^^. 

GALATI is a member of Glycosyltransferase Family 8 of the CAZy database [a database of 
putative and proven carbohydrate modifying enzymes that currently has 61 different glycosyltransferase 
families (httD://afmb.cnrs>mrs.fr/CAZY/index.html^ ^'^M. The presence of GALATI in Family 8 is 
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consistent with our demonstrated activity of GALATl as an al,4-galacturoiiosyltransferase, since Family 
8 members are retaining glycosyltransferases. GALATl is a retaining enzyme because the a- 
configpration in the substrate UDP-(X-GalAis retained in the' product al,4-linked-gaIacturononun (HGA). 

Table L Expression of recombinant putative GalAT In mammalian HEK293 cells. HA-epitoped tag^ 
truncated JS36 and JS33 genes inserted into the mammalian expression vector pEAK, along with a pEAK vector 
only control* were independendy transiently transfected into HEK293 cells. Transfected cells were grown for 46 
hours and the media was incubated with anti-HA IgG bound to either Protein A Sepharase. Immunoabsoi&ed protein 
was assayed for GalAT activity using UDP-l'.'^GJGalA and oligogalacturonides as subsliates. The data aie the 
average r^C]GalA incorporated into product (minus TO controls) from duplicate 45 min reactions. Time ©controls: 
reactions to which base was added .to inactivate the enzyme prior to the addition of substrates. Positive control: 
GalAT activity in solubiiized Arabidopsis protein. Similar results were obtained in two independent expouneAts. 



Enzyme 


['"CIGalA incorporated 


Recombinant truncated JS36 


161 : 


Recombinant truncated JS33 


30 


Vector conbrol 


0 


Solubiiized Arabidopsis enzyme (control) 


• 1434 



GALATl 16 expressed in all Arabidopsis tissues (flower, root, stem and leaf) we'examnned by 
IlT-i?CR (Sterling et aL, in preparation). Furthermore, 18 EST entries for this gene in the TAIR database 
(http://www.iarabidop5i5.org/) indicate that GALATl mRNA is expressed in Arabidopsis develq|»ing 
seed, green sUiques; roots and aerial organs. . • / * 

V. Proposed research 

V. A. Selection of putative pectin biosynthetic genes 

• VA.1. Identification of a proposed GalAT Gene Family . We propose that GALATl and She 14 
other Arabidopsis genes with high sequence sindlarity to GALTA are members of a GalAT fanssSsx and 
are involved in HGA, RG-I, RG-II and XGA synthesis. 

A standard protein blast and a PSI Blast of the NCBI protein database using the GALATl (JS36) 
amino acid sequence revealed that GALATl is one member of a 15 member putative GalAT gene family 
in Arabidopsis. The genes We selected for this GalAT family share ^ 30% amino acid identity and > 
50% amino acid similarity based on a PSI Blast. These genes were farther compared with 
GALATl using a Pairwise BLAST. The results indicate that this family of genes has > 34% identity and 
^ 52% simjilarity to GALATl in the portion of the genes C-terminal to the membrane spanningdbmain. 
This identity is comparable to the 37-54% identity shared among the proposed ten m&wbet Arab&iopsis 
fiicosyltransferase gene family (AtFUTl -10)^. 

GALATl is the pnly member of the GalAT family that has a known function.- However, the 
recent characterization of the mutant Quasimodo]^\ is in agreement with our identification of GALATl 
as a galacturonosyltransferase. The Gal A content of the walls of Quasimodo 1 is reduced by 25% and 
plants exhibit decreased cell adhesioxl^^ characteristics consistent with the possibility that the 
Quasimodo J gene encodes a GalAT. Quasimodol has 53% amino acid identity and 72% similarity to 
GALATl and the gene affected in Quasimodol is a member of our proposed putative GalAT family. 
There is as yet, however, no direct evidence that the protein encoded by Quasimodol is a functional 
GalAT. . . 

The conserved amino acids within the GalAT gene family are shown in Fig. 4. 
Glycosyltransferases typically contain one or mom carboxylates at the catalytic site. At least one of 
these carboxylates is expected to coordinate a divalent cation associated with the nucleotide-sugar Ixi 
many glycosyltransferases the inetal coordination involves two carboxylates that are often present as 



•DDx, xDD, or DDD (the so-called *'DQO motif)'^. We predict that D498 and D500 are part of die 
"DOO D* motif. However* structural and mutational data is required to confinn this. 

GALATl homologs are present in other plants, as would be expected for HGA GalAT. These 
include proteins in Cicer arietinum (chiclq)ca) (76% amino acid identity; 87% siniilarity), Orymsativa 

(japonica rice) 



H 0H~A-SW-5- 



C.W--0>«~DL 



W— GLG-* 



Figure 4. Schematic representation of the conserved sequences in the Arabidopsis 
GalAT gene family. The speckled regiQn represents the transmembrane spanning re^jon. 
The relative position of conserved ressidues represents numbering as for GALATl 
(At3g36ll30). 



(59% i&ntify; 
75% similarity) 
and P(jpulus alba 
(49% idfentity; 
72% simnilarity). 
In addition, 
tentati\e contigs 
encoding 
apparenlt 
ortholpgpes to 
GALATP, ; 
derivedfiroin 



EST sequencing efforts, have been identified in a number of plants including soybean, Medicaga 
.truncatula^ and cottori. 

V-A.2..GalAT-like family. A PSI Blast against GALATl identifies a further 10 genes tharalso 
-have high sequence identity (23-29%) and similarity (41-50% to GALATl and form a tight cluscer of 
highly similar genes (55-66% identity/67-77% similarity). A Neighbor Joining Tree of our proposed 
Arabidopsis GalAT Supecfamily (i.e. the proposed GalAT family and the GalAT-Like family)^ based on 

• a sequencjB alignment generated by 

• Figures. Arabidopsis GalAT SuperfamWy 



Clust^™, is shown in Fig.5- The 
10 GalAT-lifce genes are all 
significantly smaller, lacking -200 
amino acids in comparison with the . 
GalAT family. Nonetheless^ they 
appear to be targeted to the secretory 
pathway based on annotaticsoi of the 
genes at the Arabidopsis Information 
Resources. All 10 genes appear to 
be expressed in Arabidopsis^ since 
they are represented by one or more 
£STs in the Arabidopsis EST 
collection. The GalAT-like genes 
also .contain some of the same 
con^rved residues as the GalAT 

family, namely D-D D— L (the 

predicted "D(x) D" motif) and L 

.F w— GLG-i 

H^G— KPW. We group the 10 
GalAT-like genes into a famOy and 
propose that they encode GaSATs 
directly involved in pectin synthesis 
or GalATs with, as yet, unidentified 
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glycosylating function. 

V.A3. GlcAT family. A putative RG-H GlcAT (NpGUTl) was recently identified in Nicotiana 
plumhaginifolia^ based on the phenotype of a T-DNA transformed callus mutant that had reduced cell. 



adhesion and reduQtion in GIcA content in the wall and in RG-II. The tobacco gene has high seqnence 
similarity to a known glucuronosyltransfeiase in vertebrates. No enzyme activity for the expressed 
.proteia was reported, although chemical analysis suggested fliat the gene may encode a GIcAT involved 
in RG-n synthesis. We propose to test whether the apparent Arabidopsis homolog. AtSg61840, emcodes 
a functional RG-n:6lcAT. Tvfo Arabidopsis genes have high sequence identity (93%) and simifapty 
(95%) to NpGUTI and two other Arabidopsis genes have 42% identity and 62% similarity tp NpGUTL 
We propose that these foor genes utilize UDP-GlcA as nucleotide sugar and transfer GlcA specifically to 
RG-n orRG-L We group the genes as the putative pectin GlcAT family. This family shares the 
• following three conserved sequence modules: S-VRT-NPEEADWFY-PVYTTCD; PYWNRT-G-IDHFF- 
-.kDFGACFHY-EEKAI-G; R^IFCLCPLGWAPWSPRLVE-VIFGCIPVIIADDI-I^^ 

VAA Putative Kdotransferase family. Kdo and Dha are present in plant RG-H, some algal walls* 
and bacterial lipo-and exo-polysaccharides^. The genes that encode bacterial ICdotransferases have- . 
conserved sequences^'^^ and use CMP-Kdo as the activitated sugar donor. We identified homologiDUs 
sequences in Arabidopsis and rice. These ^nes are members of CAZy Glycosyllransfera$e Fami^SO. 
Using RT-PCR we cloned and sequenced a putative Arabidopsis KdoT gene (Hogans» Harper, an*Bar- 
Peled unpublished results) that encodes a putative membrane bound protein. Th^ Arabidopsis gens: shares 
33% amino acid sequence identity and 49% similarity to the functional KdoT from proka^yotes**. 

Vertebrate sialyltransferases (CAZy Faihily 29) are^ a group of glycosyi transferases that utilize 
CMP-addvated sugars. Oligosaccharides containing sialic acids are not present in plants. Never&eless, 
sialic acid is structurally similar to Kdo and Dha. A 2,6- and 2,3-sialyltransferase from rat liver^^ anid* 
porcinie submaxillary gland^ have a common 55 amino acid sequence ('^sialyl motif . 
(C AWGNSGTLLNSQY GDLroKHEIVIRIJWAKTE-RFEKKVGSKT^^ presumably tocated 

in their active domain. Arabidopsis contains two gpnes that are grouped in the same glyco'syltransferase 
family as the sialyltransferases (CAZy 29). The two Arabidopsis genes encode putative membrane bound 
proteins and have the same 55 amino acid domain present in mammalian sialyltransferases. Thus, we 
propose to determine if the two Arabidopsis genes in CAZy Family 29 are putative Kdotransferases: 

V,B. Summary of scheme to identify gene Junction 

We propose to take three complementary approaches to define the function of the putative pectin 
biosynthetic genes (Fig. 6). (1) Each gene will be hetei;ologously expressed and the expressed proieins 
assayed for their ability to transfer radiolabeled sugar (from a nucleotide-sugar) to a series of oligo^and 
polysaccharide acceptors using a multi-acceptor pectin biosynthesis asaay (MAPA). (2) Antibodies will 

be generated to the expressed 



gene 
in mammalian 
and Fichia 
systems 



Candidate Gene 
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acceptors 
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Figure 61 Summary of scheme to identify pectin biosynthetic genes. 



protein and used for subcellular 
immunocytochemistry to provide 
information on whether or not the 
location of the gene product 
supports a role in pectin syntbesis. 
These antibodies will be also be 
used in a permeabilized mend>rane 
assay (PMA) to inhibit transfer of 
radiolabeled sugar to endogenous 
acceptors. (3) The pectins in the 
cell walls of available niutants for 
each geiie will be structurally 
characterized. A change in pectin 
structure would suggest the point - 
at which pectin synthesis was 
effected and thereby provide 
infonnation on enzyme activity of 



9 



the gene product. Pectin acceptors from such mutants would be isolated and used in the MAPA if 
ejizymadc activit/ had not already been confinoed in (1). If enyzme activity for the expressed protein 
was not (^tainedv even using mutant acceptors in the MAPA, a permeabilized membrane assay (PMA) 
using membranes from the mutant plant and an aliquot of the heterologously expressed protein wouDd be 
undertaken to attempt to add radiolabeled sugar onto the mutant acceptor and thereby complenient Ae 
pecdn synthesis defect in. the mutant membranes. The specific methods to be used are described below. - 

V.C, Cloning of the genes ' - ' 

The selected genes will be cloned by RT-PCR using RNA from Arabidopsis root, stem, leavcsy 
flower and suspension cultures (depending on where highest speciflc mRI^A levels are expressed), or the 
cloned genes will be obtained from the Arabidopsis Biological Resource Center (ABRC) if they are 
already available. The genes will be inserted into the vector p.CR2.1 to allow easy manipulation into 
other expression vectors and to facilitate DNA sequence determination. Standard procedures includbig 
PGR and restriction digestion comparisons will be used to confirm gene identify. Several of the PIsBave 
extensiveexperience with fliese techniques^ 

V.D. Heterologous expression 

We have showil (see section IV JB.) that human embryonic kidney (HEK293) cells transiently 
infected with recombinant truncated GALATI expressed active recombinant GALAT enzymie. Wheseas 
transient expression systenti are sufficient to measure activity of an expressed protein, alternative 
expression strategies are lequixed to produce proteins in quantities sufficient for further characteriza&ion 
of the enzyme and for antibody production (~100 iig/immunization). One strategy will be to produce 
stable transfected clonal HEK293 lines*°* expressing each of the selected genes. 'Truncated versions of 
the genes will be made by removing.the N-terminus and transmembrane domain as was down for 
GAIATl- The g0ne constructs will* contain an N-termihal epitope-tag since such GALATI constructs- 
e>cpressed in mammalian cells Were active, and since numerous type II membrane glycosyltransferases 
and glycosidases have been shown to retain activity with N-terminal tags^^ -"^ and N-terminal 
truncations^*'^. An alternative stiatogy will be to express the full length and truncated forms of the 
genes in Pichia pastoris. Although both Pichia and mammalian cells can be used to express plant wall 
biosynthetic enzymes^* it is not possible to predict, a priori^ which of the systems will produce floe 
highest amounts of functional enzyme. Thus, we will initially make constructs in both systems. 

Kelley Moremen, a facul^ iriember at the CCRC routinely expresses proteins in mammalian . 
cells and has advised us on the transient expression of GALATI in mammalian HEK293 cells. He is also 
advising us on our current production and cloning of stable GALATI transfected cell lines, and will 
continue* to advise us for this proposal (see letter of support). The basic protocol will be as in the Edge 
BioSystems protocols with the modification suggested by the Moremen lab. 

. For expression in P. pastoris, cDNA encoding the entire, and the truncated soluble forms, of 
putative pectin biosynthetic enzymes will be generated by PGR using gene/vector specific primers. The 
PGR products will be subcloned into an appropriate Pichia expression vector in which the cDNA is 
inserted downstream from an alcohol oxidase (AOXl) promoter (Invitrogen). We are preparing GALATI 
constmcts in the Pichia and will determine whether functional C-terminal epitope-tagged constructs ttiat . 
do not effect OalAT activity can be recovered. If activity is recovered, we will make similar constructs 
for the other putative pectin biqsynthetic genes. Pichia will be transformed with the linearized 
recombinant constructs and expression of the recombinant protein will be induced by growth in 
methanol. Cells will be plated and screened for growth under selection and the selected colonies will be * 
assayed for enzyme activity in the MAPA. Approximately 10-20 colonies enzyme expressing clonal 
lines will be quantitatively assayed for their respective enzyme activity to identify the highest expressing 
line. Numerous glycosyltransferases'^'^ and plant enzymes have been functionally expressed in 
Pichia, and one of the Co-PIs (Bar-Peled) has functionally expressed nucleotide-sugar biosynthetic 
enzymes (UDP-Xyl synthase, UDP~Rha epimerase/reductase, UDP-GlcA epimerase) in Pichia, 
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ExpressicMD in Pichia is more cost-effective than expression in manunalian cells and can generate- hig^ 
levels of expressed protein (^1 gmU*). Once a high-enzyme expressing line is recovered, lar^e amounts 
of protein will be produced in fermentors (available at the CCRC and at the University of Georgia 
Fermentation Research Facility) or spinner flasks and related eqiiq)ment used for scale up of mammalEan 
cell culture (available at the CCRC). 

V,E, Multi-acceptor gectin biosynthesis assay (MAP A) 

V-E-1- Strategy, T he basis of the MAPA is to rapidly determine if an expressed gene is a pectim 
biosyntbetic GalAT, GlcAT, or KdoT.. However, synthesis may require the presence of additional dooor 
and acceptor substrates since pectin is a complex polymer and some of the selected genes may encode 
polymer backboiie synthesizing enzymes. For example, a GalAT that is specific for xylogalacturonam 
(XGA) synthesis may require flie presence of UDP-Xyl in the reaction while RG-I backbone synthesis 
will likely require both UDP-GalA and UDP-Rha: Thus, our strategy for the MAPA is to initially (est 
each heterologously expressed gene in a minimum number of reactions that contain a mixture of 
substrates and acceptors so as to maximize the likelihood of recovering enzyme activity. Our initial 
strategy will be to test each expressed protein in four reactions that each contain a single radiolabeled • 
nucleotide-^gar, a mixture of non-labeled nuCleotide-sug?u^s, anil a mixture of acceptors (see Table BtJu 
. However, we anticipate that as we identify more pectin bipsyhthetic enzymes and g^in a better . 
understanding of their requirements, the specifics pf the number of reactions used in the initial screenu 
and the reaction components will change.. The basic enzyme buffer, reaction components, and pH will be 
those developed for GalATs in general and as adapted for Arabidopsis GALATl (Sterling and 
Mohnen, unpublished). The conditions to be used are applicable to numerous plant wall 
glycosyltransferases^ The initial assay will be a precipitation assay developed by the PI ^ that should B»e 
broadly useful for pectin biosyntbetic enyzmes. However, two other assays developed by the PI, a filler 
assay*^ and a non-radiolabeled PAGE assay' "will be used if warrented. The PI and a Co-PI have 
extensive bibchernical experience with multiple glycosyltransferases.^'''^^'•".^""^ pectm biosyntbetic 
metbyltransferases '^•'^•"^ and nucleotide-sugar biosyntbetic enzymes »9^;86.n4.ii6 . ^ 

If our predicted function of the selected putative pectin biosyntbetic genes is correct, the selected 
genes require UlDP-galacturonic (UDP-GalA), UDP-glucuronic acid (UDP-GlcA), CMP-Kdo, and UI»^ 
rhamnose (UDP-Rha) as activated sugar substrates. Of these, only UDP-GlcA is commercially availaWfe. 
The remaining required sugar nucleotides will be synthesized by methods already established in our • 
laboratories at the CCRC. The PI routinely synthesize UDP-[^^C]GalA^ ®^'^ '' either by the 4- 
epimerization of UDP-[^'*C]GIcA" or oxidation of UDP-t^'^ClGal'" A Co-PI (Bar-Peled) has developed 
. methods, to enzymatically synAesize UDP-Rha*"'"^ and more recently identified, cloned, and expressed . 
recoiribinant plant genes (Harper and Bar-Peled, unpublished) in K colt that allow facile enzymatic 
generation of TDP-Rha and UDP-Rha (Watts and Bar-Peled, unpublished). We have produced CMP-Kdo 
by cloning and expressing two recombinant enzymes that convert GTP, PEP and Ara to CMP-Kdo 
(Hogans and Bar-Peled, unpublished). The PI is director of CarboSource Services, a part of the NSF- 
lunded Plant Cell Wall Biosynthesis Research Network which produces and distributes UDP-Xyl (see 
sefctionVU). 

V.E.2. Multiple acceptor pectin biosynthesis assav rMAPA), Heterologously expressed 
recombinant proteins will be assayed for pectin biosyntbetic activity by determinitig whether they 
catalyze the transfer of a radiolabeled sugar residue to any one of several pools of pectic acceptors (Table 
II). Acceptor preparations will be generated by treatment of homogalacturonan, RG-I and RG-II that wc * 
routinely isolate'**'^^ with mild acid, hydrolytic enzymes, or combinations of these reagents (Table II). 
These treatments selectively cleave specific glycosidic bonds, thereby exposing potential acceptor sites 
to which glycosyl transferases might transfer a glycosyl residue. Monosaccharides and oligosaccharides 
(DP < 5) will be removed from the resulting mixtures by gel-permeation and/or ion-exchange 
chromatography. Some of the pectic polymers obtained will be further fragmented. For example, RG-1 
(Table n,. prep 6) will be treated with acabinosidase/arabinanase. Ion-exchange chromatography will be 
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used to separate the partially degraded arabinan sidechains (prep 8) from the dearabinosylated RG-I (prep 
7), which will be treated with RG-hydrolase to cleave the backbote, thereby generating desural^inosyhted 
RG-I fragments (prep 12). 

The structaral features of each acceptor sobstrate preparation will be characterized by standard 
methods that have previously been developed and applied in our laboratory*'**"'^* For examplej, the 
molecular weight distribution will be determined by SEC. Glyco^l residue compositions will be 
determined in triplicate by GC-MS analyses of alditol acetate (neutral residues) and TMS methyl 
glycoside (acidic + neutral residues) derivatives. Glycosyl linkage compositions will be performed by 
GC-MS analysis of partially methylated alditol acetate (PMAA) derivatives. The ratio of 2-linked to 
terminal GlcA and the ratio of 4-linked to terminal GalA will be determined by chemically reducing 
uronic acid residues to their 6^6-dideuterio hexosyl derivatives, which will be converted to PMAAs that 
are readily identified by GC-MS. . .. 



Table TL Acceptors to be. used for the multiple acceptor pectin biosynthesis assay (MAPA). Boxes indicate thefbur 
acceptor pools that will be used in die initial screening assays. For example. Pool 1 will contain Preps 1-4, 



Pool 


Prep 


Source 


Treatment 


M^or Component Produced far 
use as an Acceptor Substrate 


1 


1 


Commerciai'* 
polygalactiironic acid 


None 


Polygalacturonic acid 


2 


Commercial pectin 


None . 


Methyl-esterifled HG 


3 V 


Commercial 
polykalactuonic acid 


£/zdl9-poiygalacturonase 
(partial) 


Oligogalacturonides (OGAs) 
(DP 3-23) 


4 


Commercial pectin 


jS>uf9-polygalacturonase» 
pectin methylesterase. 
(partial) 


Randomly methyl-esterified 
oligogalacturonides (DP 3^23) 


2 


5 


A. ihaliana seed mucilage 


No treatment 


RG-I backbone 


6 


Cuitived plant cells and 
tissues 


Ent/o-polygalacturonase 
andSEC 


Highly branched* native RG-I 


7 


RG-IfromPtep6 


Arabinanase (or mild 
acid) and ion-exchange 


Dearabinosylated RG-I 


8 


RG-I from Prep 6 


Arabinanase (or mild 
acid) and ion-exchange 


Partially degraded RG-I arabinan 
sidechains. 


9 


RG-I from Prep 6 


j&j^galactaiiase and 
ion-exchange 


Degalactosylated RG-I 


10 


RG-IfromPtep6 


i?/u/a-galactanase and 
ion-exchange 


Partially degraded RG-I galactan 
sidechains 


3 


11 


RG-I backbone from 
Prep 5 


RG Hydrolase 
+/- rhamnosidase" . 


RG-I backbone fragments ' 


12 


Dearabinosylated RG-I 
from Prep 7 


RG Hydrolase ' 

+/- rhanmosidase^ 


RG-I fragments* depleted in 

arabinan sidechains 


13 


Degalactosylated RG-I^ 
from Prep 9 


RG Hydrolase 
•f/- ihamnosidase'^ 


RG-I fragments' depleted in 
galactan sidechains 


4 


14 


Cultured plant cells and 
tissues 


Endo-polygalacturonase 
and SEC 


RG-n 


15 • 


RG-n from Prep 14 


Mild acid 


Partially fragmented RG-II 



■RG-hydrolase cleaves the RG-I backbone, generating fragments witii a GalA residue at the reducing terminus. In 
the absence of ihamnosidas9» die non-reducing terminus of each fragment will be rhamnosyU but in the presence of 
rhanmosidase. the non-reducing terminus will be GalA 



Each acceptor substrate pool (Table H) will be mixed with a radiolabeled nucleotide sugar, a 
mixture of non-labeled nucleotide-sugars, and an aliquot of the recombinant gene product Following . 
incubation, acceptor and reaction product? (including those that have been radiolabeled by the action of 
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an expressed glycosyltransferase) will be precipitated with methanohchloroform^*" and non-incoiponfied 
nucleotide-sugar will be removed by repeated washing with aq 65% ethanoP'^. Glycosyltransferase 
activity wjll be detected by the presence of radioactivity in the pellet.. The nature of Ae 
glycosyltransferase reaction catalyzed by a given exayme will be deduced by examining increases in 
alcohol-insoluble radioactivity for different donor and acceptor combinations. An enz^e that transfos 
a glycosyl residue to a pectic pplymer that has stnictural features conimoh to more than one of the 
acceptor substrate pools will produce an increase in alcohol-insoluble radioactivity in more than one off 
the reaction mixtures. As the overall structural features of the components of each substrate pool willlbe 
known, this will provide information regarding the nature of glycosyltransferase activity, which can tben 
.be further characterized by peiforming glycosyl transferase assays using individual relevant 
acceptors/acceptor pools. Ultimately, highly purified acceptor substrates will be used and the radioacHDve 
|)roducts will be chemically characterized. 

V,F, Characterization of putative pectin biosynthetic mutants 

V.P.1. Strategy . Nineteen of our selected genes have one or more sequence-indexed T-DNA 
insertion mutants currently listed in the Salk Institute Genomic Analysis Laboratory (SIGnAL). 
Arabidopsis T-DNA mutant collection fhttp://signal:salk.edu/cgi-bin/tdnaexpress') . We expect that 
additional T-DNA mutant lines will become available prior to, and during the time the research proposed 
is carried out, since the Arabidopsis sequence-indexed T-DNA insertion Project is only beginning its 
second year of iiinding. We will obtain these mutants as they are deposited. We haye already receive^, 
grown, and selfed eight of the T-DNA mutants and the additional available mutants have been ordered 
fromABkC. ... 

The putative pectin biosynthesis mutants will aid in the identification of gene function in two 
ways. The visible phenotypes of such mutants may provide information on the biological function of fflte 
gene (if there is no redundancy in gene functiori) by demonstrating when during growth and development 
the particular gene product is needed. Structural analysis of the pectin in the mutant walls may provide 
information about the specific enzyme ac'^vity of the gene in pectm synthesis. Furthermore, the mutants 
can aliso provide precursor polysaccharides that could be used as exogenous acceptors to identify enzyme 
activity. For these reasons, the available putative pectin biosynthetic mutants will be characterized for 
plant phenotype and wall stmcture and, when applicable, the pectic polysaccharides from such mutants 
will be isolated for use as exogenous acceptors. Finally, when applicable, particulate and soluble 
fractions from the mutants will be used for in vitro complementation enzynne assays to confirm the 
enzyme activity of the expressed gene. 

VP.2. Initial characterization of mutant phisno tvpes and bulking up of seed. A portion of tfie 
initial mutant seed obtained from ABRC (segregating T3 line, see http://signal.salk.edu/tdna_FAQs.htnii) 
will be grown and allowed to self-cross to increase the seed stock (T4). Multiple plants from T4 seed of 
each line will be grown and the presence of the T-DNA insert determined by PCR of plant genomic DNA 
using a T-DNA primer and a gene specific primer. The same DNA will be analyzed with gene specific 
primers that should span the T-DNA insertion site. These analy§eis shpuld indicate whether the given 
plant contains a T-DNA insert and if so, whether it is homozygous or heterozygous for the mutation. If * 
necessary. We will use Southern blotting and hybridization with the specific genes, as reconunended by 
the SIGnAL web site» to detennine if the gene contains the expected T-DNA insert. 

Seed homozygous for the T-DNA insertion (when not lethal) or heterozygous (when no viable 
TDNA homozygous plants are obtained) will bis selfed to amplify the seed and, for heterozygous plants^ 
to test for segregation of any phenotype or T-DNA insert. If the parent plant was heterozygous for the T- 
DNA mutant, we would expect approximately a 3:1 mutant segregation in the next generation. If it was * 
homozygous, we should $ee a 100% mutant phenotype. Plants will be scored as heterozygous or 
homozygous by PCR analysis of the T-DNA insert and by any visible phenotype. Homozygous or 
heterozygous plants will be used for growth phenotfype and cell wall analysis. The seed will also be 
crossed with wild type Columbia and thei) selfed to eliminate the possibility that the lines contain an 
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unexpected mocation or additional T-DNA insertCs). All growth of npitant and wild type plants will be 
under controlled temperature, humidity and light conditions in growth chambers (Russell Malmberg, 
Plant Biology Dept. has agreed to advise us on the characterization of the mutants, see letter of 
suppport). 

V.F.3. Growth Phei^otype analysis . Several growth parameters of the mutant and wild type plants 
will be recorded to yield a general phenotypic characterization of the mutant plants. We will limit our 
initial phenotypic analyses to soil grown plants and use the growth stage designations of Boyes et al.^^. 
Bi-weekly measurements will be taken of seed germination, cotyledon and hypocotyl emergence, numba 
of rosette leaves >1 mm, plant height, inflorescence emergence, number of flowers, number of flower * 
stalks, number of siliques, number and length of siliques at end of flowering'stage, and number (mass) cd 
seeds obtained. 

VJFA Analvsisof Cell Walls. H omozygous or heterozygous plants will also be grown and 
analyzed for wall composition and linkage. Cell walls will be prepared as alcohol insoluble residues. 
(AIRs) from WT and (homdzygqus) mutant Ardbidopsis plant tissues^^*. AIRs will be prepared by 
homogenizing leaves and stems (from soil-grown plants) and roots (iGrom liquid-cultunsd plants) in aq. 
80% EtOH followed by washes with abs EtQH, chloroform-methanol, and acetone. Separate fractions 
conta]ning.RG4» RG-II and oligogalacturonides will be obtained by size-exclusion chromatography 
(SEQ and ion exchange chromatography of the material solubilized from the cell walls by treatment wiffi 
pectin methyl esterase (PME) and cn^/a-polygalacturonase (EPG). The yields, glycosyl residue 
compositions, and glycosyl linkage compositions of each fraction will be determined^^ 

v. G.- Production and characterization of putative acceptors from mutant walls . ' 

Cell walls and pectin sub&actions containing RG-I, RG-:!! and homogalacturonans will-be prepared 
as described above in section .VJF.4. Specific pectic subfructions that will be used as acceptor substrata 
will be prepared and characterized as described in section. V.E.2. (see Table IV). 

V,H. Permeabilized membrane Assay 

If no enzyme activity is discernible in the MAPA* membranes from wild type aind mutant plants 
will be isolated, suspended in reaction buffer containing 0.1% Triton X-100 to permeabilize the 
membranes '^"^^^^ and incubated in the presence and absence of both the heterologously expressed 
protein and the specific antibody against .the protein. Inhibition of the transfer of a radiolabeled sugar 
(e.g. from UDP-X'^C]GalA) onto endogenous acceptor in membranes incubated with the antibody^ 
compared to activity in the absence of the antibody, would indicate that the protein encoded a pectin 
biosynthetic enzyme specific for the endogenous acceptor. Recovery of activity in mutant permeabilized 
membranes to which the heterologously expressed wild-type protein was added (i.e; in vitro * ^ 
recon^timtion) would confirm this activity. Pectic acceptor molecules would then be generated from .the 
mutant plant ceU walls and used to attempt to obtain activity in the MAPA. 

V J. Generation of antibodies • 

Polyclonal antibodies will be generated in two ways to enhance the likelihood of obtaining useful 
specific antibodies for each gene product: Our main strategy will be to use purified recombinantly 
expressed proteins to generate rabbit polyclonal antibodies at the University of Georgia Polyclonal 
Antibody Production Service, The recombinant proteins used for immunizations will lack any putative 
transmembrane domain^ and will be generated as described in section IV.B.* Antisera that are obtained 
will be cleared of antibodies against the epitope tag sequence(s) incorporated into the heterologously 
expressed proteins by imanunoabsorption against synthetic epitope tag peptides (to be synthesized at 
Molecular Genetics Ihstrumentation.Facility, University of Georgia) coupled to a solid support. 

The specificity of the antisera will be tested by immunoblotting against the full series of purified 
heterologously expressed proteixis. In addition, the antibodies, will be tested for their abilities to 
specifically bind to native proteins as assayed by immunoassays against extracts prepared fit>m 
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Ambidopsis tissues. We will also test the antisera for their abilities to inhibit the activity of any ensgrnpas 
for which i/i vdn> activity has been demonstrated. 

Given that the proteins within the different femilies being stndied have domains showing strong 
sequence similarities/identities (e.g., the GalAT superfamily, see Figure 7). immunizing with the . 
complete heterologously expressed proteins may not yield monorspecific se^ Thus, we will also 
immunize rabbits with an eight-branched, multiple antigenic protein (MAP) synthesized based on 
peptide sequences derived those portionaof the proteins that showlittie if any sequence conservation 
amongst the members of a given protein family (e.g. the M^terminal portion of the C-temrinal domams of 
the GalAT superfamily). Colleagues at the CCRChave successfully used this approach to generate 
specific antisera against Cochliobolus endopolygalacturonase . The MAPs will be synthesized at Ae 
Molecular Genetics Instrumentation FacUity, Univeisity of Georgia, using tiie 8-branch Fmoc resm . 
The strength of the MAP system is its ability to generate specific anti-peptide antibodies, but the abiUty 
of the MAPrgenerated antibodies to bind to intact proteins may be more limited. Thus, it may prove 
necessary to make multiple MAPs based on different non-ponserved peptides and detenmne which, if 
any, of them recognize the complete proteins. 

V J Immunocytochemistjy:SubceUular localization of putaHve pectin biosyhOteric enzymes 

We propose to use immunocytochemistry tp detemune tfie subceUular localization of each of the 
pectin biosynthetic proteins within Arabidopsis cells from a tissue shown by RT-PCR to express tiie 
specific protein of interesL Such localizations will provide additional information with vhich to 
determine the role(s), if any, of the various polypeptides in pectin biosynthesis. For example, 
localization of a polypeptide to Golgi cisterpae would be consistent with a function of fliat polypeptide.m 
pectin biosyntiiesis synthesis. Localization of a polyp^tide to anotfier cellular compartment (e.g., the 
vacuole) would indicate that the polypeptide is unlikely to have a rote in pectin biosyntiiesis. In tiie latlss 
case, die polypeptide.and corresponding gene wiU be given a lower priority fpr the furtiier detailed 
studies that are outlined elsewhere in tills application. ^ 

Botfi developing Ara&wfopjw seedlings and growing suspension cultures will be used, smce bofflu 
tissues contain actively dividing and elongating cells tfiat are syntiiesizing walls. We have extensive 
exoerience wiUi the preparation and use of tfiese tissues for immunocytochenucal studies bqtii at the 
cellular and sub-cellular levels^'" "'^"'. Tissue sections will be prepared from four day old Arabidopsii 
seedlings or exponentially growing suspension cultured Arafeiac?;»m cells, and immunolabel^ 
described" Iraraonolabeled sections will be examined at 80kV with a Zeiss EM 902A Electron 
microscope available for our use in tiie Plant Biology Department at tiie University of Georgia. 

VI. Broader impact of results including how project wiU promote teaching and training and participation of 

underreprtoented groups . j j v j -n 

The PI and Co-PI's routinely advise multiple undergraduate students m undergraduate researcher and will 
continue to do so through the fesearch described in this proposal . For example, the PI has mentored 30 
^deigradukte students over the last 12 years. Seven of these students were minonty students. It is expected that 
each PI and Co-PI will mehtor at least one undergriiduate student in independent research on the proposed project Sm 
the duration of the proposed research. Since most biology-based majors atUGA require at least one s«nester of 
independent research, and since the Georgia Hope Scholarship program pajfs iiill toiUon for any Georgian student 
who maintains a B average in coUege, there is a vast pool of bright and talented students who wish lo cany out 
undergraduate research. Although we have only budgeted for 5 undergraduate students/yr. we anUc.pate traimng • 
more students since many students do one or more semesters of undergraduate research for. academic credit 

To increase the participation of historically underrepiesented groups in the research pool, we have also 
arranged witfi Curtis D. Byrd (Director of minority Recmitmcnt and Retention. University of Georgia) to recniit 
ftomlhe pool of outstanding undergraduates from historically underrSppresented ©roups that come to UGA each 
summer through the Summer Undergraduate Research Program (SURP) ( www rradsch.URa.e<|p/)rr/) . Tlie students 
will work vrith the PI. Co-PIs. graduate students and/or postdocs to cany out intensive summer research. nie SUIU' 
students wiU also be expected to write a paper and present a poster describing their research project at the end of the 
research eiqierience. 
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,„ ^ , — ™,fh riirtis D Bvrd to recruit one or more inconung minority gtadoate studentsfnto 
We have also ^^nged w,th Cuijs U^ayro n,is program funds accepted mtaority gradrnte 

the project through *e "^-Ij^J-^^^tte^^^^^ rScarch at UGA devdbp researdfc 

srsoSt'is^i^^s^^^^^^^^^ 

Vn. Results from Prior N§F Support 

(Co-PD: 2/15/0l-2n4/06: $559,867 *°f^fj"jv??t the partially support the acquotio: 

the goals of the grant, that ^'^^^^f^^^'^^^^^^X^^^^^^ ^edo this. «r 
of substrates and a^eptors for^me^^ 

estajbUshed "CarboSource Services V^apM^^^&^ available Is* the plantccell 

Coordinator^was ^^^^^^^ and'dlstributed to 24«»earch g.up 

wall biosynthesis coimnuniQr. We have P"»?"=^ J;^" ^ oroduction of UDP-GalAand die 
a.oughouc*ej^rtdj.ea^^^^^ 

producuon of UDP-Ara for eventuM aismo ^ Arrowhead Conference Center are out&cd 

r^^ESSw^s^*^^^^^^ 

^ iMCW r rant MCB-9974tf73i 08/01/99-07/3^03; $330,000; Dynamic models ofxylogluemt conforn^c 
(York) NSF Grant ^ characterize the molecular and topological psocesses 

in the P^^.^^^;^^°([^^^^7S^^^ a loed4«ring net,vork in the prin.ary ceU w^l. We 

govermng the 'n^®'^"" °f .''^'"Stac^^ technically chalfengjng. raultidisciplinaiy pwgect. We 

performed experiments that ^^V^ff tfre sensitivity niTtt 

developedmethod^toprepaie ^-ennched plant On on plants grown in die presense 

heteronuclear NMRexperiments «'^«»y,*^'^f°f™f.^^„i^^^^^^ xyloglucan andcello-oligome 

simulations for several °f*??°^f^^^^^^Z^confonnBti^ characteristic of cellulose and. 
oUgoglycosrdes have a net "f^-^^^ ^^" SgSbus experimental evaluation of this result is being perform«Et 

j^r^jis^aTod^^^^^^^^^ • 

andfimctional legume genomes. Total ^7^J^;f^f^^°^^^ as a model spcciesfor leguiBE. 
wasamulti4nvestigator.multi-««ttmt.onal^tto^^^^^^ 

biology and combined '^^^^^^^^^S^^^^^^^ »4«iy focused on the generatfan 

participated in the 8''"°™" P^'°™ developmentaBy-specifc 

^A^?S^'"lSy elicitor. arid oligpgal^oni* 

cDNA Ubranes. 5pecaicaiiy» we gcnciavcu ^iro^oA PCTq frnm the olicoelucoside elicitor library and 

generated a subtraction cDNA hbrary using iw. '^"J!'" ™ „„d„|ite A/ trancateto ecotypes: 5. meliloti m 
Lliloti. The two S. meliloti strains chosen J" J''^^^^ ti^tSl^i rSh produces only 

185 induces normaf elongated pink nodules on M ^'^'^'J'^^^ ^^ij^J'Z^^^^ truncZ 

reiLentedintheM. r«„car«/a EST collection ^^."^^f^^J,'^^^^^ 
■ trwcfltuto interaction, I^lbUcations are being prepared sumrnanztng the results M * . . 



16 



References 

1. H. Iwai, N. Masaoka, T. Ishii, and S. Satoh, A- pectin glucuronyltransferase gene is essential * 
for intercellular attachment in the plant meristem^ Pipc, Natl. Acad. Sci. USA 99:163119- 
16324(2002). 

.2. K.A. White, LA. Kaltashov, RJ. Cotter, and C.R.H. Raetz, A Mono-functional S-deoxyD- 
. manno-octulosonic acid (Kdo) transferase and a Kdo kinase in extracts of Haermphibos 
. influenzae, ThcJ.Bioh Chem. 272: 1 6555-1 6563 (1997). 

3. Z.-Y. Wang, R-Q. Zheng, G,-Z. Shenm. J.-P. Gao, D.P. Snustad, M.-G. Li, J.-L. ZBfian&and 

M.-H. Hong» The amylose content in rice endosperm is related to the post-transor^tioiiEal 
regulation* of the waxy gene, PlanUL 7:^ 

4. T. Clementz and C.R1H. Raetz, A gene coding for 3-deoxy-b-ma/irto-octulosonic-aicid 

transferase in Eschericial colL Identification, mapping, cloning, and sequencing, J. Biol-. 
. Chein, 266:9687-9696 (1991). ' 

5. D. Mohnen, Biosynthesis of pectins; in: "Pectins and their Manipulation", G.B. Seymouret al., 

Blackwell Publishing and CRC Press, Oxford, pp. 52-98 (2002). 

6. V. Walbot, Arabidopsis thaliana genome. A green chapter in the book of life, Natuig 408:794- 
•795(2000). 

7. R.L. Doong, K. Liljebjelke, G. Fralish, A. Kumar, and D. Mohnen, Cell free synthesis off 
. pectin: identification and partial characterization of polygalacturonate 4-a- 

galacturonosyltransferase and its products from membrane preparations of tobacco 
(Nicotiana tabacum L. cv san^sun) cell suspension cultures. Plant PhvsioL 109:141-152 
. (1995). 

.8. R.L. Doong and D. Mohnen, Solubilization and characterization, of a galacturonosyUransfierase 
that synthesizes the pectic polysaccharide homopalacturonan. Plant J. 13:363-374 (1998). 

9. H.V. Scheller, R.L. Doong, B.L. Ridley, and D. Mohnen, Pectin biosynthesis: a solubilizad 

galacturonosyltransferase from tobacco catalyzes the transfer of galacturonic acid from 
UDP-galacturonic acid onto the non-reducing end of homogalacmronan, Planta 207:512-517 
(1999). 

10. il. Sterling, H,F. Quigley, A. Orellana, and D. Mohnen, The.catalytic site of the pectin 

biosynthetic enzyme a-l,4-galacturonosyltransferase (GalAT) is located in the lunnen of the 
Gtolgj, Plant PhvsioL 127:360-371 (2001). . .. 
1.1, B.L. Ridley, M.A. O'Neill, and D. Mohnen, Pectins: structure, biosynthesis, and 
oligogalacturonide-related signaling, Phvtochemistrv 57:929-967 (2001). 

12. M.A. O'Neill, D* Warrenfeltz, K. Kates, P. Pellerin, T. Doco, A,G. Darvill, and P. 

Albersheim, Rhamnogalacturonan-E, a pectic polysaccharide in the walls of growing pliant 
cell, forms a dimer that is covalently cross-linked by a borate ester - in vitro conditions for 
the formation and hydrolysis of the dimer, J. Biol. Chem. 271:22923-22930 (1996). 

13. E. Zablackis, W.S. York, M. Pauly, S. Hantus, W.-D. Relter, CCS. Chappie, P. Albersheim, • 

and A. Darvill, Substitution of L-fucose by L-galactose in cell walls of arabidopsis mio-l. 
Science 272:1808-1810 (1996). 

14. M. O'Neill, P. Albersheim, and A. Darvill, The pectic polysaccharides of primary cell walls, 

in: "Methods in Plant Biochemistry, Volume 2", P.M.'Dey., ed., Academic Press, London, 
pp. 415-441 (1990). * . . 



17 



15. W.S. York, A.G. Darvill, M. McNeil, T.T. Stevenson, and P. Albersheim, Isolation and 
characterization of plant cell walls and cell wall components. Methods Enzvmol. 1^8:3-40 

* (1985). 

16. F. Goubet and D. Mohnen, Subcellular localization and topology of homogalacturonan 
methyltransferase in suspension-cultured Nicotiano tabacum cells, Planta 209: 1 12-117 
(1999). 

17. F. Goubet and D. Mohheh, Solubilization and partial characterization of homogalacteronanr 
methyltransferase from microsomal* membranes of suspension-cultured tobacco celis^ Plamt 
PhisioL 121:281-290 (1999). 

18. D. Mohnen, Biosynthesis of pectins and galactomahnans, in: "Comprehensive Natural 
Products Chemistry, Vol. 3, Carbohydrates and Their Derivatives including Tannins,. 
Cellulose, and Related Lignins", B.M. Pinto., ed., Elsevier, Oxford, pp. 497-527 (1999). 

19. K. Liljebjeike, R. Adolphson, K. Baker, R.L. Doong, and D. Mohnen, Enzyrn?itic synthesis 
and purification of uridine diphosphate [^"^CJgalacturonic acid: a substrate for pectin 
biosynthesis. Anal. Biochem. 22St296-304 (1995), . ^ 

20. M.A. O'Neill, S. Eberhard, P. Albersheim, and A.G. Darvill, Requirement of borate cross- 
linking of cell wall rhamnogalacturonaii II for Arabidopsis growth. Science 846-849 (200]). 

21. M. Pauly, Q. Qin, H. Ghreene. P. Albersheim^ A. Darvill, and W.S. York, Changes in tbe 
structure of xyloglucan during cell elongation, Planta 212:842-850 (2001 ). 

22. D. Mohnen and M.G. Hahn, Cell wall carbohydrates as signals in plants> Sena, Cell Biol. 
4:93-102(1993). 

23. S- Eberhard, N. Doubravai V. Marf^ D. Mohnen, A. Southwick, A. Darvill, arid P. 
Albersheim, Pectic cell wall fragments regulate tobacco thin-celHiayer explant 
morphogenesis. Plant Cell 1:747-^755 (1989). . . 

24. V. Maifa, D.J. Gollin, S. Eberhard, D. Mohnen, A. Darvill, and P. Albersheim, 

• Oligogalacturonides are able to induce flowers to form on tobacco explants. Plant J. 1:217- * 
225(1991). 

25. M.G. liahn, A.G. Darvill, and P. Albersheim, Host-pathogen interactions XK. The 

. endogenous elicitor, a fragment of a plant cell wall polysaccharide that elicits phytoalexin 
accumulation in soybeans^ Plant Physiol. 68:1161-1169 fl981). 

26. F. Cervone, M,G. Hahn, G. De Lorenzo, A. Darvill, and P. Albersheim, Host-pathogep 
interactions XXXHL A plant protein converts a fungal pathogenesiis factor into an elitiltor of 
plant defense responses. Plant PhvsioL 90:542-548 (1989). 

27. G. De Lx)ren20, K Cervone^ M.G. Hahn, A, Darvill, and P. Albersheim, Bacterial endopectate 
lyase: evidence that plant cell wall pH prevents tissue maceration and increases the half-life 
of elicitor^active oligopalacturonides. PhvsioL Mol. Plant Pathol. 39:335-344 (1991). 

28; M. Bar-Peled, C.L. Griffith, and T.L. Doering, Functional cloning and characterization of a 
UDP-glucuronic acid decarboxylase: the pathogenic fungus Cryptococcus neoformans 
elucidates UDP-xvlose synthesis. Proc. Natl. Acad. Sci.USA 98:12003-12008 (2001). 

29. S.U. Ahmed, M. Bar-Peled, and N.V. Raikhel, Clonging and subcellular location of an 

arabidopsis receptor-like protein that share conunon features with proteins- sorting receptors 
of eukaryotic cells. Plant Phvsiol. 1 14:325-336 (2000). 

30. M. Bar-Peled and N.V, Raikhel, Characterization of AtSEC12.and AtSARl - proteins likely 
involved in en$Ioplasmic reticulum and Golgi transport, Plant Phvsiol. 1 14:315-324 (2000). 



18 



31. R.M. Perrin, A.E. DeRocher, M. Bar-Peled, W. Zeng, L. Noramboena, A, Orellana, M.V. 
Raikhel, and K. Keegstra, Xyloglucan fucosyltransferase, an enzyme involved in planftcell 
wall biosynthesis. Science 284:1976-1979 (1999). 

32. S.U. Ahmed, M. Bar-Peled, and N.V. Raikhel, Cloning and subcellular location of an 
arabidopsis receptor-like protein that shares common features with protein-sorting reoeptois 
of eukaryotic cells. Plant Phvsiol. 115:311-312 (1997). 

33. M. Bar-Peled and N.V. Raikhel, Characterization of ATSEC12 and ATSARI-proteinslSkdy 
involved in the endoplasmic-reticulum and Golgi transport. Plant PhvsioL 1 14:3 15-324 
(1997). - 

34. M. Bar-Peled and N.Y. Raikhel, An efficient method for cloning in-frame fusion proteia 

genes. Anal. Biochem. 250:262-264 (1997). 

35. M. Bar-Peled, D.C. Bassham, and N.V. Raildiel, Traiisport of proteins in eukaryotic cdlC- 
' more questions aheaiL Plant Mol. Biol. 32:223-249 (1996). 

36. M. Bar-Peled, A. da Silva Conceicao, L. Frigerio, and N.V. Raikhel, Expression and 
regulation of aERD2, a. gene encoding the KDEL receptor homolog in plants, and othor 
genes" encoding proteins involved in ER-Golgi vesicular trafficking. Plant Cell 7:667-676 
(1995). ' . ' 

37- M. Bar-Peled and N.V. Raikhel, A . method for isolation and purification of specific 

antibodies to a protein fused in the GST. Anal. Biochem. 241:140-142 (1996). 
38. R. Balestrini, M.G. Hahn, A.. Faccio, K. Mendgen, and P. Bonfante, Differential localization 

of carbohydrate epitopes in plant cell walls in the presence and absence 6f arbuscul^r 

mycoirhizal fungi. Plant Phvsiol. 111:203-213 (1996). 
39- G. Freshour, R.P. Clay, M*S. Fuller, P. Albersheim, A.G. Darvill, and M.G. Hahn, 

Developmental and tissue-specific structural alterations of the cell-wall polysaccharides of 

Arabichpsis thaliana Roots, Plant PhvsioL 110:1413-1429 (1996): 

40. MJSf.V. Williams, G. Freshour, A.G. Darvill, P. Albersheijn, and.M.G. Hahn, An antibody 
Fab selected from a recombinant phage display library detects deesterified pectic 
polysaccharide rhamnogalacturonan IT in plant cells. Plant Cell 8:673-685 (1996). 

41. G. Kossoy, H: Ben-Hur, I. Zusman, .and Z. Madar, Effects of a 15% orange-pulp diet on 
tumorigenesis and immune response in rats with colon tumors, Oncol Rep 8: 1387-1391 
(2001), 

42. 1. Eljaz, The potential role of modified citrus pectin in the prevention of cancer metastasis. 

Clinical Practice of Alternative Mediciiie 2: 1-7 (2001). 
43. D. Piatt and A. Raz, Modulation of the lung colonization of B16-F1 melanoma cells by dtras " 

pectin. J. Natl. Cancer Inst. 84:438-442 (1992): 
44.. K J. Pienta, H. Naik, A. Akhtar, K. Yamazaki. T.S. Replogle. Ji Lehr. T.L. Donat, L. Tait, V, 

Hogan, and A. Raz, Inhibition of spontaneous metastasis in a rat prostate cancer model by 

oral administration of modified citrus pectin, J. Natl. Cancer Inst 87:348-353 (1995). 

45. J.F. Pirani, The effects of phytotherapeutic agents on prostate cancer: an overview of recent 
cUnical trials of PC SPES, Uroloev 58:36-38 (2001). 

46. H.A: Schols, M.A. Posthumus, and A.GJ. Voragen, Stractural features of hairy regions of 
pectins isolated from apple juice produced by the liquefaction process, Carbohvdr. Res. 
206:117-129(1990). . • . ' ' . , 

47. H.A. Schols, E-J.. Bakx, D. Schipper, and A.G.J. Voragen, A xylogalacturonan subunit 
present in the. modified hairy regions of apple pectin, Carbohvdr. Res. 279:265-279.(1995). 



19 



48. L. Yu and A.J. Mort. Partial characterization of xylogalacturonajns from cell walls of i^e 
watermelon fruit: Inhibition of endopolygalacturonase activity by xylosylation, in: "Pbctins 
and Pectinases", J. Visser et al., Elsevier. Amsterdam^ pp. 79-88 (1996). 

49. D.A. Hart and P.K. Kindel, Isolation and partial characterization of apiogalacturonans feom 
the cell wall of Lemna minor, Biochem, J. 1 16:569-579 (1970). 

50. R.R. Watson and N.S. Orenstein, Chemistry and biodsemistiy of apiose. Adv. Carboh^, 
Chem, Biochem. 31:135-184 (1975). 

51. A. Fleischer, M.A. ONeU, and R. Ehwald, The pore size of non-graminacebus plant cdB 
* walls is rapidly decreased by borate ester cross-linking of the pectic polysaccharide 

ihamnogalacturonan H, Plant PhvsioL 121 :829-838 (1999). 

52. J.-C. MoUet, S.-Y. Park, E.A. Nothnagel, and E.M. Lord, Alily .stylar pectin is necessaiy for 
pollen tube adhesion to an in vitro stylar matrix. Plant Cell 12:1737-1749 (2000). 

53- T.L. Western, J. Bum, WX. Tan, D. J. Skinner, L. Martin-McCaffrey, B.A. Moffatt, and 
' G.W. Haughn, Isolation and characterization of mutants defective in seed coat nvucilags 
secretory cell development in Arabidopsis. Plant PhvsioL 127:998-101 1 (2001). 

54. W.G.T. Willats, L. McCartney, and J.P. Knox, In-situ analysis of pectic polysaccharides i^i 
seed mucilage and at the root surface of Arabidopsis tkaliana, Pljanta 2 1 3 :37-44 (200 1 )1 * 

55. Z-H. GonzSlez-Carranza, CA. Whitelaw, R. Swanip, and JA. Roberts, Temporal and spat^ 
expression of a polygalacturonase during leaf and flower atecission in- oilseed rape and 
Arabidopsis, Plant PhvsioL 128:534-543 (2002). • 

56. M.A. Zwieniecki, P.J. Melcher, and N.M. Holbrook, Hydrogel control of xylem hydraulic ' 
resistance in plants, Ss^ce 291:1059-1062 (2001). 

57. F. C6t6, K,-S, Ham, M.G.Hahn, and G:W. Bergmann, Oligosaccharide elicitors in host- 

. pathogen interactions generation, perception, and signal transduction, in: "Plant-Microbe 
hiteractipns", B.B, Biswas et al.. Plenum Press, New York, pp. 385-432 (1998). 

58. S. Fry, Aldington, P: Hetherington, and J. Aitken, Oligosaccharides as Signals and 
Substrates in the Plant Cell Wall, Plant PhvsioL 103: 1-5 (1993). 

59. N. Shibuya and E. Minami, Oligosaccharide signalling for defence responses in plant, 
Phvsiol. MoL Plant Pathol. 59:223r233 (200l). 

60. R.G. Atkinson, R. Schrfider, IC- Hallett, D. Cohen, and E.A. MacRac, Qverexpression of . 
polygalacturonase in transgenic apple trees leads to a range of novel phenotypes involving 
changes in cell adhesion. Plant PhvsioL 129:122-133 (2002), 

61. M. Skjot, M. Pauly, M.S. Bush, p. B.orkhardt, M.C. McCann, and R Ulvskov, Direct 

interference with rhamnogalacturonan I biosynthesis in Golgi vesicles. Plant PhvsioL 

* 129:95-102(2002). 

62. S. Bouton, E. Leboeuf, and G. Mouille, QUASIMODOl encodes a putative membrane-bound . 

* glycosyltransferase required for normal pectin synthesis and cell adhesion in Arabidopsis, 
Plant Cell 14:2577-2590 (2002). 

63. D. Mphnen, R.K Doong, K. liljebjelke, G. Fralish, and J. Chan, Cell free synthesis of the 

pectic polysaccharide homogalacturonan; in: "Pectins and Pectinases"-, J. Visser et al., 
Elsevier Science B.V, Amsterdam, pp: 109-126 (1996).. 

64. N.C. Cajrpita and D;M. Gibeaut, Structural models.of primaiy cell walls in flowering plants: 

consistency of molecular stracture with the physical properties of the walls during growtb; 
HantL.3: 1-30 (1993). 

j * 

20 



65. - j'. An, M. A. O'Neill, P. Albersheim, and A.G: Darvill, Isolation and structural 

characterization of p-E>-glucosyIufonic acid and 4-O-inethyl P-D-glucosyluronic acid- 
containing oligosaccharides from the cell-wall pectic polysaccharide, rhamnogalactunn^ ^ 
Carbohvdn Res.. 252,-235^243 (1994). • ' ' 

66. J. An, L. Zhang, M.A. O'Neill, P, Albersheim, and A.G. Darvill, Isolaton and structural 
characterization of endo-Aamnogalacturonase-generated fragments of the backbone of * 
rharhnogalacturonan L Carbohvdr. Res. 264:83-96 (1994;^- 

67. W.G.T. Willats, L. McCartney, W. Mackie^ and LP. Knox, Pectin: cell biology and prppects 
for functional analysis. Plant Mol. Biol. 47:9-27 (2001). 

68. A. Kikuchi, Y. Edashige, T. Ishii, and S. Satoh, A xylogalacturonan whose level is depcffirientt 
on the size of cell clusters is present in the pectin from cultured carrot cells, Plahta 200:369- 
372(1996). . 

69^ G.O. Aspinall, Chemistry of cell wall polysaccharides, in: "The Biochemistry of Plants, 
VoL3.", J. Preiss., ed.. Academic Pressi New York, pp. 473-500 (1980). 

70, E.M. Lord and J.-C, Mollet, Plant cell adhesion: a bioassay facilitates discovery of the finst 
pectin biosynthetic gene, Proc. Natl. Acad. Sci. USA 99:15843-15845 (2002). 

71. D.H, Northcote, The Golgi apparatus. Endeavor 30:26:33 (1971). 

'72: D.H. Northcote and SJD. Pickett-Heaps, A function of the Golgi Apparatus in polysaccharide 
synthesis and transport in the root-cap cells of wheat. Biochem. J. 98:159-167 (J 966). 

73. P.J. Harris and D.H. Northcote, Polysaccharide fomiation in plant golgi bodies. Biochim. 
Biophvs. Acta 237:56-64 n97n. 

74. R.W. Stoddart and D.H. Northcote, Metabolic relationships of the isolated fractidns of the 
pectic substances of actively growing sycamore cells. Biochem. J. 105:45-59 (1967). 

75. G.F. Zhang and L.A. Staehelin, Functional compartmentation of the golgi apparatus of pl;2nt 
cells; Xoimunocytocheniical analysis of high-pressure frozen* and freeze-substituted 
sycamore maple suspension culture cells. Plant Physiol, 99:1070-1083 (1992). 

76. L.A. Staehelin and L Mooie, The plant Golgi apparatus: structure, functional organization 
and trafficking mechanisms, Annu. Rev. Plant Phvsiol. Plant Mol. Biol. 46:261-288 (1995). 

77. M.E. Edwards, C.A. Dickson, S. Chengappa, C. Sidebottom, M.J. Gidley, and J.S.G. Reidt, 
Molecular characterisation of a membrane-bound galactosyltransferase'of plant cell wall 
matrix polysaccharide biosynthesis. Plant J. 1 9:69 1 -697 (1 999). 

78. A. Faik, N. J. Price, N.V. Raikhel, and K. Keegstra, An Arahidopsis gene encoding an a- 
xylosyltransferase involved in xyloglucan biosynthesis. Ptoc. Natl. Acad, Sci, USA 99:7797- 
7802(2002). 

79. D.P. Delmer, A hot mutant for cellulose synthesis. Trends in Plant Science 3 : 1 64( 1 998). 

80. C.L. Villemez, A.L. S wahson, and W.Z. Hassid, Properties of a polygalacturonic acid- 
synthesizing enzyme system from Phaseolus aureus seedlings. Arch. Biochem. Biophys. 
116:446-452(1966). 

81. H. Kauss and A.L. Swanson, Cooperation of enzymes responsible for polymerization and 
methylation in pectin biosynthesis. Z\ Naturforsch. 24:28-33 (J 969). 

82. T.-Y. Lin, A J). Elbein, and J.C. Su, Substrate specificity in pectin synthesis. Biochem. 
Biophvs. Res. Commun. 22:650>657 (1Q66). 

83. G.P. Bolwell, G. Dalessandro, and D.H. Northcote, Decrease of polygalacturonic acid 
synthase during xylem differentiation in sycamore. Phvtochemistrv 24:699-702 (1 985): 



21 



84. Y. Takeuchi and Y. Tsumuraya, In vitro biosynthesis of homogalacturonan by a menfibiaine- 
bound galacturonosyltransferase from epicotyls of aznki bean, Biosci. Biotech. BiochcmtL 
65:1519-1527(2001). 

85. K. Akita, T. Ishinuzu, T, Tsukiainoto, T. Ando, and S. Hase, Successive glyco§yltransfcr 
activity and enzymatic characterization of pectic polygalacturonate 4-a- 
galacturonosyltransferase solubilized from pollen tubes of Petunia axillaris using 
pyridylaminated oligogalacturonates as substrates. Plant Physiol, 130:374-379 (2002), 

86. A- Orellana and D. Mohnen, Enzymatic S3mthesis and purification of uridine 
diphosphate galacturonic acid for use in studying Golgi-localized transporters, AnalvticaiE 
Biochemistrv 272:224-231 (1999). 

87. J-A, Lemons, LF. Forkner, J. Sterling, and D. Mohnen, Development of a filter assay for 

measuring homogalacturonan: al,4-galacturonosyltransferasc activity, in preparation. 

88. S. Pagfes, H.C.M. Kester, J. Visser, and J.A-E. Benen, Changing a single amino acid resi^ 
switches processive and non-processive behavior of Aspergillus niger ' 
#>nrfripn1yga1actnrona.se T and IL J, Biol. Chem. 276:33652-33656 (2001), 

89. R. A.F. Reithmeier and CM. Deber, Intrinsic membrane protein structure: principles and 
prediction, in: "the Structure of Biological Membra^les^ P. Yeagle., ed., CRC Press,. Bcaa 
Raton, pp. 337-393.(1992). 

90. J.A, Campbell, GJ. Davies, V. Bulone, and B. Henrissal, A classification of nucleotide- 
diphospho-sugar glycosyltransferases based on amino add sequence similarities, Bioch^JL 
326:929-942(1997). 

91. B. Heyirissat and GJ. Davies, Glycoside. hydrolases aijd glycosyltransferases. Faixiilies, 

modules, and implications for genomics. Plant Phvsiol 124:1515-1519 (2000). 

92. R. Sarria, T.A, Wagner, MA, OlSfeill, A. Faik, C.G. Wilkerson, K. Keegstra, and N.V. 
Raikhel, Characterization, of a family of Arabidopsis genes related to xyloglucan r 
fncnsyltransfera^el. Plant Phvsiol. 127:1 595>- 1606 (2001). 

93. S.J. Chamock, B, Henrissat, and G.j. Davies, Three-dimensional structures of UDP-sugar 
glycosyltransferases illuminiate the biosynthesis of plant polysaccharides. Plant Phvsiol. 
125:527-531 (2001). 

94. C. Whitfield and M.A. Valvano, Biosynthesis and expression of cell-surface polysaccharide 

in gram^nftgftHve bacteria. Adv. Microb. Phvsiol, 35:135-246 (1993). 

95. M. Regu6. N. Climent, N. Abitiu, N. Coderch, S. Merino, L. Izquierdo, M. Altarriba, and 

J.M. Tom5s, Genetic characterization of the Klebsiella pneumoniae waa gene cluster, 
involved in core lipopolysaccharide biosynthesis, J. Bacteriol. 183:3564-3573 (2001). 

96. J. Shang, R. Qiu, J. Wang, J. Uu, R. Zhou, tt. Ding. S. Yang, S. Zhang, and C- Jjn,. 
Molecular cloning and expresssion of Gall ,3GalNAc 23-sialyltransferase from human Ifetal ' 
Hven Eur. J. Biochem. 265:580-588 (1999). 

97. W. Gillespie, S. Kelm, and J.C. Paulson, Cloning and expression of the Gal beta l,3GalNAc 
. alpha 9.^^5:i'aly1transferase, J. Biol. Chem. 267:21004-21010 (1992), 

98. BJD. Livingstom and J,C. Paulson, Polymerase chain reaction cloning of a developmentally 

regulated member of the sialyltransferase gene family. J. Biol. Chem. 265:580-588 (1993). 

99. D. Mohnen, H. Shinshi. G. Felix, and FJ. Meins, Hormonal regulation of pl,3-glucanase . 

messenger RNA levels in cultured tobacco tissues. gNDiOJL 4:1631-1635 (1985). 



22 



100. H. Shinshi, P, Mohnen, andF. Meins, Jr. Regulation of a plant pathogenesis-related 
enzyme: Inhibition of chitinase and chitinase mRNA accumulation in cultured tobacco 
tissues by auxin and cytokinin, Proc. NatL Acad. Sci. USA 84:89-93 ( 1 987). 

101. CM. Cabral, Y. Liu, K-W. Moremen, and R.N. Sifers, Organizational diversity among 
distinct glycoprotein endoplasmic reticulum-associated degradation programs^ MoL BioL 
Cell 13:2639-2650 (2002). 

102. A.C- Porchia and H.V. Scheller» Arabinoxylan biosynthesis: identification and partial 
characterization of beta-l,4-xylosytransferase from wheat, physiological olantanim 1 10:S50- 
356 (2000),(Abstract) 

103. J.F. Kukowska-Latallo, R.D. Larsen, R.P. Nair, and J.B. Lowe, A cloned human cDNA 
determines expression of a mouse stage-specific embryonic antigen and the Lewis blood 
group aIpha(l,3/l,4)fucosyltransferase, Genes Dev. 4:1288-1303 (1990). 

104. M. Mallisard, S. Zeng, andE:G. Berger, Expression of functional soluble forms of humasB 
• -l,4-galactosyltransferaseI» * *2,6-sialyltransferase, and • -2,6-sialyltransferase, and • - 

• ■ 1 ,3-fncosyltransferase. VI in the methylotrophic yoBsiPichia pastoris , Biochem. BiophvsL 
Res. Commun. 267:169-173 (2000\ 

105. iP.A. Romero, M. Lussier, AM. Sdicu, H. Bussey, and A. Herscovics, KTRIP is an alpha- 
1 ,2-mannosyItransferase of Saccharomyces cerevisiae comparison of the ^n23dnatic 
properties of soluble recombinant KTRIP. and KRE2P/MNT1P produced in Pichia pdstowis^ 
Biochem. J. 321:289-295 (1997^ . . 

106. U. Hochstrasser, M. Luscher, C. DeVirgilio, T. Boiler, and A. Wiemken, Expression of a 

, functional barley sucrose-fiructan 6-fractosyl transferase in the methylotrophic yeast Pichia 
pastoris. Febs Letters 440:356-360 f 1998^. 
liQ7. P.R Gallet, H, Vaujour, J.-M. Petit, A. Maftah, A. Oulmouden, R. Oriol, G. Le Narvori ML 
Guilloton, and R. Julien, Heterologous expression of an engineered truncated form of human 
Lewis fucdsyltransferase (Fuc-THO by the methylotrophic yeast Pichia pastoris, 
Glvcobiology 8:919-925 (19981 

108. ^K.W. Moreman and O. Touster, Topology of mannosidasw II in rat liver membrane and 
. release of the caalytic domain by selective proteolysis, JrBioL Chem 23: 10945-10951 

(1986). 

109. R.K. Merkle, Y. Zhang, P.L Ruest, Y.I?. Liao, and K.W. Moremen. Cloning, expression. \ 
purification, and charactarization.of the murine lysosonial acid alpha-mannosidase, Biochiing 
BioDhvs. Acta 1336:132-146(19971 

110. Y.F. Liao, A. Lai, and K.W. Moremen, Cloning, expression,.purification, and 
characterization of the human broad-specificity lysosomal acid alpha-mannosidase, J. BioL 
QienL 271:28348-28358 (1996). 

111. H.F. Quigley, D.L. Howard, J. Sterling, and D. Mohnen, A non-radioactive gel 
electrophoresis assay for homogalacturonan • -1,4-galacturonosyltransferase, Anal. 
BiochenL (2002). 

1 12. A. Faik, M. Bar^Peled. A, DeRocher, W. Zeng. R.M. Peirin, C \Vilkinson, N, Raikhel, and 
K. Keegstra, Biochemical characterization and molecular cloning of an • -1,2- 
fucosyltransferase that catalyzes the last step of cell wall xyloglucan biosynthesis in pea. The 
Journal of Biological Chemistry 275:15082-15089 (2000). 



23 



1 13. M. Bar-Peled, R. Huhr, and J. Gressel, Juvenile-specific localization and accumulation off a 
rhamnosyltransferase artd its bitter flavonoid in foliage, flowers, and young citrus fruits. 
Plant PhvsioL 103:1377-1384 (1993). 

1 14. M, Bar-Peled, E. Lewinsohn, R. Huhr, and J. Gressel, UDP-rhamnose:flavanone-7-0- 
glucoside-2"-0-rhamnosyltransferase. Purification and characterization of an enzyme 
catalyzing the production of bitter compounds in citrus. J. Biol, C3iem, 266:20953-20959 

. (1991). 

115. F. Goubet, L.N. Council, andD. Mohnen, Identification aiid partial characterization of the 
pectin methyltf ansferase "homogalacturonan-methyltransferase" from men^branes of tobaoco 
cell suspensions. Plant Physiol 116:337-347 (1998). 

116: A.D. Harper and M. Bar-Peled, Biosynthesis of UDP-Xylose. Clbning and characterizatice 
of a novel Arabidopsis gene family, XJXS^ encoding solubIj& and putative membrane-bound 
■ UDP-Glucuronic acid decarboxylase isoforms. Plant PhvsioL 130:2188-2198 (2002), 

1 17. S-S. Basu, G.D. Dotson, and C.R.H! Raetz, A facile enzymatic synthesis of uridine 
Hiphn5;phn-[14r!]galactiironic acid. Anal. Biochem. 280:173-177 (2000). 

118. S. Vidal, T. Ddco, P. Williams, P. Pellerin, W.S. York, MA. ONeill. J. Glushka, A.G. 
Darvill, aind P. Albersheim, Structural characterization of the pectic polysaccharide 
rhamhogalacmronap 11: evidence for the backbone location of the aceric acid-containing, 
oligoglycosyl side chain. Caifaohvdr. Res. 326:277-294 (2000). 

119. AJ. Whitcombe, M.A. CNeBI, W. Steffan, P. Albersheim, and A.G. D^ill, Structural 
characterization of the pectic polysacchride, Rhamnogalacturonan-II, Carbohvdr. Res, 
271:15-29 (1995)^ 

120. D.C. Boyes, A.M. Zayed, R. Ascenzi, A.J. McCaskill, .N.E- Hoffman, K.R. Davis, and J. 
Gorlach, Growth stage-based phenotypic analysis of arabidopsis: a model for high 
throughput functional genomics in plants^ Plant Cell 13:1499-1510 (2001). 

121 . E. Zablackis, J. Huang, B. MuUer, A-G. Darvill, and iP. Albersheim, Characterization of the 
cell-wall polysaccharides of Arabidopsis thaliana leaves. Plant PhvsioL 107:1 129-1 138 
(1995). 

122. J.P. Briand, C. Barin, MJI. van Regenmortel, and S. Mullef, Application and limitations of 
- ' the multiple antigen peptide (MAP) system" in the production and evaluation of anti-peptide 

and anti-protein antibodies. J. Immunol. Methods 156:255-265 (1992). 

123. V: Ziehl, R.P. Clay, C. Bergmann, A. Darvill, and P. Albersheim, Polyclonal antibodies 
directed against synthetic N-terminus of fungal endopolygalactoronase recognizes the native 
protein and cross-reacts with endopolygalacturonase of a.different funal species, (2002). 
(UiiPub) 

124. G. DiLaurenzio. J.- Wysocka-Diller, J.E. Malamy, L, Pysh, Y, Helariutta, G. Freshour, M.G. 
Hahn, K.A'. Feldmann, and PJI. Benfey, The SCARECROW gene regulates an asymmetriQ 

• Cell division that is essential for generating the rs^dial organization of the Arabidopsis root. 
Cell 86:423-433 (1996). ' 

125. M.G. Hahn, G. Freshour, W.-D. Reiter, P. Albersheim, toad A-O- Darvill, 
Immunolocalization of fucose-containing polysacchairides in cell walls of the murl mutant of 
Arabidopsis thaliana. Plant Physiol (2002) (in press). * ; - ' 

126. K. Enkerli, M.G. Hahn, and C.W. Mims, Ultrastructure of compatible aiid incompatible 
interactions of soybean roots infected with the plant pathogenic comycete Pbytophthora 
sojae. Can. J. Bot. 75:1493-1508 (2002). 



•24' 



127. K. Enkerli, M.G. Hahn, and C.W. Mims, Immunogold Icxralization.of callose and other plant 
cell wall components in soybean roots infected with the plant padiogehic oomycete 
Phvtophthora sojae. Can. J. Bot 75:1509-1517 f 1997). 

128. J.D. Thompson, T.J. Gibson, R Plewniak, F. Jeanmougin, and D.G. Higgins, The ClustalX 
windows intelrface: flexible strategies for multiple sequence alignment aided by quality 
analysis tools. Nucleic Acids Res. 24:4876-4882 (1997) 



25 



The amino acids which occur in the varions amino add sequences refeired to in the 
specification have their usual three- and one-letter abbreviations routinely used in the art: A, Ala, 
Alanine; C, Cys, Cysteine; D, Asp, Aspartic Add; E, Glu, Glutamic Add; F, Phe, Phraylalanine; 
G, Gly, Glycine; H, His, Histidine; I, He, Isoleudne; K, Lys, Lysine; L, Leu, Leucine; M, Met, 
Methionine; Asn, Asparag^ne; P, Pro, Proline; Q, Gbi, Giutamine; R, Arg, Aiginine; S, Ser, 
Serine; T, Thr, Threonine; V, Val, Valine; W, Try, Tryptophan; Y, Tyr, Tyrosine. 

A protein is considered an isolated protein if it is a protdn isolated from a host cell in 
which it is recombinantly produced. It can be purified or it can simply be free of other proteins 
and biological materials with which it is associated in nature. 

An isolated nucleic add is a nucldc add the structure of which is not identical to that of 
any naturally occurring nucleic acid or to that of any fragment of a naturally occurring genomic 
nucleic add spanning more than three separate genes. The term therefore covers, for example, 
(a) a DNA which has the sequence of part of a naturally occurring genomic DNA molecule but is 
not flanked by both of the coding or noncoding sequences that flank that part of the molecule in 
the genome of the organibm in which it naturally occurs; (b) a nucleic add incorporated into a. 
vector or into the genomic DNA of a prokaryote or eukaryote in a maimer such diat the resulting 
molectde is not identical to any naturally occurring vector or genomic DNA; (c) a separate 
molecule such as a cDNA, a genornic fragment, a fragment produced by polymerase chain 
reaction (PGR), or a restriction fragment; and (d) a recombinant nucleotide sequence that is part 
of a hybrid gene, i.e., a gene encoding a fusion protero.' Sp^ecifically excluded from this 
definition are nucleic adds present in mixtures of (i) DNA molecules, (ii) transformed or 
transfected cells, and (iii) cell clones, e.g., as these occur in a DNA library sudi as a cDNA or 
genomic DNA library. 

As used herein expression directeSl by a particular sequence is the transcription of an 
assodated downstream sequence. If appropriate and desired for the associated sequence, there 
the term ^pression also encompasses translation (protein synthesis) of the transcribed RNA. • 
When expression of a sequence of interest is 'Sip-regulated," the expression is increased. 

In the present context, a promoter is a DNA region which includes sequences sufficient to 
cause transcription of an associated (downstream) sequence. The promoter maybe regulated, 
i.e., not constitutively acting to cause transcription of the assodated sequence. If indudble, there 
are sequences present which mediate regulation of expression so that the associated sequence is 
transcribed only when an inducer molecule is present in the medium in or on which the organism 
is cultivated. In the present context, a transcription regulatory sequence includes a pitomoter 
sequence and the cis-active sequences necessary for regulated expression of .an associated 
sequence in response to enviromnental signals. 

One DNA portion or sequence is downstream of second DNA portion or sequence when 
it is located 3* of the second sequence. One DNA portion or sequence is upstream of a second 
DNA portion or sequence when it is located S' of that sequence. 
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One DNA molecule or sequence and another are heterologous to another if the two are 
not derived from the same ultimate natural source. The sequences may be natural sequences, or 
at least one sequence can be designed by man, as in the case of a multiple cloning site region. 
The two sequences can be derived from two different species or one sequence can be produced . 
by chemical synthesis provided that the nucleotide sequence of the synthesized portion Was not 
derived from the same organism as the other sequence. 

An isolated or substantially pure nucleic acid molecule or polyaucleotide is a- 
polynucleotide which is substantially separated from other polynucleotide sequences which 
naturally accompany a native GALATl transcription regulatory sequence. The term embraces a 
polynucleotide sequence which has been removed from its naturally occurring environment, and 
includes recombinant or cloned DNA isolates, chemically synthesized analogues and analogues 
biologically synthesized by heterologous systems, 

A polynucleotide is said to encpde a polypeptide if, in its native state or when 
manipulated by methods known to those skilled in the art, it can be transcribed and/or translated 
to produce the polypeptide or a fragment thereof. The anti-sense strand of such a polynucleotide 
' is also said to encode the sequence. 

A nucleotide sequence is operably linked whai it is placed into a fimctional relationship 
with another nucleotide sequence. For instance, a promoter is operably linked to a coding 
sequence if the promoter effects its transcription or expression. Generally, operably linked 
means that the sequences being linked are contiguous and, where necessary to join two prot^ 
coding regions, contiguous and in reading frame. However, it is well known that certain genetic 
elements, such as enhancers, may be operably linked even at a distance, i.e., eveil if not 
contiguous. 

The terai recombinant polynucleotide refers to a polynucleotide which is made by the 
combination of two otherwise separated segments of sequence accomplished by the artificial 
manipvaption of isolated segments of polynucleotides by genetic engineering techniques or by . 
chemical synthesis. Jxi sp doing one may join together polynucleotide segments of desired 
fimctioiis to .generate a desired combination of £m 

Polynucleotide probes include an isolated polynucleotide attached to a label or reporter . 
'molecule and may be used to identify and isolate other galatl transcription regulatory sequences, 
for example, those from other species ofAureoba^idium or other strains of A. pullulans. Probes 
comprisiag synthetic oligonucleotides or other polynucleotides may be derived from naturally 
* occurring or recombinant single or double stranded nucleic acids or be chemically synthesized. 
Polynucleotide probes maybe labeled by any of the methods known in the art, e.g., random 
hexamer labeling, nick translation, or the Klenow fill-in reaction. 

Large amounts of the polynucleotides may be produced by replication in a suitable host 
cell. .Natural or synthetic DNA fragments coding for a protein of interest are incorporated into 
recombinant polynucleotide constructs, typically DNA constructs, capable of introduction into 
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and repiication in a prokaryotic or eukaryotic cell, Arabidopsis thaliana, wherein protein 
. expression is desired. In addition to the Arabidopsis thaliana specifically exemplified herein, 
others can be used, including but not limited to. A, pullulcms Y-1 17 [Li et al. (1997) Fungal 
Genetics Newsletter 44:29-32]; Apullulans R106 [Thonrewell et al. (1995) Gene 162:105-1 10]; 
A. pullulans NRRL Y-6220, Y-6754a, Y-12,974, YB-4026 and YB-4588 [Leathers et al. (1988)' 
j: Indus. Microbiol 3:231-239]; and A pullulans CBS 58475 [Dobberstein and Emeis (1989) 
Appl Microbiol Biotechnol 32:262-268]. Usually the construct is suitable for rq)lication in a 
unicellular host, such as A. pullulans or a bacteritim, but a multicenular Eukaryotic host may also 
be appropriate, with or without integration within the gmome of the host cell. Commonly used 
prokaryotic hosts include strains of Escherichia coli, although other prokaiyotes, such as Bacillus 
subtilis or a pseudomonad, may also be used. Eukaryotic host cells include yeast, filamentous . 
fimgi, plant, insect, amphibian and avian species, but the regulated expression of a protein of 
interest a cell of the genus Aureobasidium, especially -4. pullulans, is preferred. Such factors as 
ease of manipulation, ability to appropriately glycosylate expressed proteins, degree and control* 
of protein expression, ease of purification of expressed proteins &way firom cellular contanunants 
or other &ctors influence the choice ofthe host cell. . 

The polynucleotides may also* be produced by chemical synthesis, e.g., by the ' 
phosphoramidite method described by Beaucage and Caruthers (1 98 1) Tetra. Letts. , 22: 1 859- 
1862 or the triester method according to Matteuci et al, (1981) Jl Am. Chem. Soc, 103: 3 185, and . . 
may be performed on commercial automated oligonucleotide synthesizers. A double-stranded 
fragment may be obtained fix>in the single stranded product of chemical synthesis either by 
synthesizing the complementary strand and annealing the strand together under appropriate . 
conditions .or by adding the compleqientary strand uising DNA polymerase with an appropriate 
primer sequence. 

DNA constracts prepared for introduction into a prokaryotic or eukaryotic host will 
typically comprise a replication system (i.e. vector) recognized by the host, including the 
intended DNA firagment encoding the desired polypeptide, and will preferably also include 
transcription and translational initiation regulatory sequences operably linked to the polypeptide- 
encoding segment. Expression systems (expression vectors) may include, for example, an origin 

• of replication or autonomously replicating sequence (ARS) and expression control sequences, a • 
promoter, an enhancer and necessary processing information sites, such as ribosome-binding 
sites, RNA splice sites, polyadenylation sites, transcriptional terminator sequences, and mRNA 
stabilizing sequences. Signal peptides may also be included where appropriate firom secreted 

. polypeptides of the same or related species-, which allow the protein to cross and/or lodge in cell - 
membranes or be secreted firom the cell. 

An appropriate promoter and other necessary vector sequences will be selected so as to be 
functional in the host. Examples of workable combinations of cell lines and expression vectors 
are described in Sambrook et al. (1989) vide mfira; Ausubel et al. (Eds.) (1995) Current 
Protocols in Molecular Biology, Greene Publishing and WUey IntersciOTce, New York; and 
Metzger et al. (1988) Nature^ 334: 3 1-36. Many useful vectprs for expression in bacteria, yeast, 
fungal, miammalian, insect, plant of other cells are well known in the art and may.be obtained 
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such vendors as Stratagene, New pngland Biolabs, Promega Biotech, and others. In adfdition, the . 
construct may be joined to an amplifiable gene (e.g., DHFR) so that multiple copies of the gene 
may be made. For appropriate enhancer and other expression control sequences, see also 
Enhancers and Eukaryotic Gene Ejtpressiony Cold Spring Harbor Press, N.Y. (1983). While 
such expression vectors may repUcate autonomously, tihey may less preferably replicate by being 
inserted into the genome of the host cell. 

ET^ression and'ploning vectors will likely contain a selectable niarker, that is, a gene 
encoding a protein necessary for the survival or growth of a host cell transformed wifli the vector. 
Although such a marker gene may be carried on another polynucleotide sequence co-introduced 
into the host cell, it is most often contained on the cloiiing vector. Only those host cells into 
which the marker gene has been introduced will survive and/or grow under selective conditions. 
Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxic 
substances, e.g., ampicillin, neomycin, methotrexate, etc.; (b) complement auxotrophic 
deficiencies; or (c) supply critical nutrients not available from coniplex media. The choice of the 
proper selectable marker will depend on the host-cell; appropriate markers fpr different hosts are 
known in the art. 

Recombinant host cells, in the present context, are those which have been genetically 
modified to contain an isolated DNA molecule of the instant invention. The DNA can be 
introduced by any means known to the art which is appropriate for the particular type of cell, * 
including without limitation, transformation, lipofection ot electroporation. 

It is recognized by those skilled in the art that the DNA sequences may vary due to the 
degeneracy of the genetic code and codon usage. All DNA sequences which code for the 
GALATI signal peptide are included in this invention, including DNA sequences as given in 
herein having an ATG preceding the coding region for the mature protein, and including DNA 
sequences with and. without the intron identified in the Specification. 

Additionally, it will be recognized by those skilled in the art that allelic variations may 
occur in the DNA sequences which wiU.not significantly change activity of the amino acid 
sequences of the peptides which the DNA sequences encode.. All such equivalent DNA 
sequences are included within the scope of this iovention and the definition of the regulated 
promoter region. The skilled artisan will understand that the sequence of the exemplified 
promoter sequence and the nucleotide sequence encoding the signal peptide can be used to 
identify and isolate additional, noiiexemplified nucleotide sequences which are fimctionally 
equivalent to the sequences given in the Specification. 

Hybridization procedures are useful for identifying polynucleotides with sufSdent 
homology ( i.e,, homologs of GAJLATl in other plants) to the subject sequences to be usefiil as 
taught herein. The particular hybridization techniques is not essential to the subject invention.* 
As improvements are made in hybridization techniques, they can be readily applied by one of 
ordinary skill in the art ' 
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A probe and sample are combined in a bybridization buffer solution and held at an 
appropriate temperature until annealing occurs. Thereafter, the membrane is washed free of 
extraneous materials, leaving tiie sample and bound probe molecules typically detected and 
quantified by autoradiography and/or liquid scintillation counting. As is well known in the art, if 
the probe molecule and nucleic acid sample hybridize by forming a strong non-covalent bond 
between the two molecules, it can be reasonably assumed that the probe and sample are 
essentially identical, or completely complementary if the aimealing and washing steps are carried 
out under conditions of high stringency. The probe's detectable label provides a means for 
detennining whether hybridization has occurred. 

In the use of the oligonucleotides or polynucleotides as probes, the particular probe is 
labeled witfi any suitable label known to those skilled in the qrt, including radioactive and non- 
radioactive labels. Typical radioactive labels include ^^P,^^S, or the like. Non-radioactive labels 
include, for example, ligands such as biotin or thyroxine, as well as enzymes such as hydrolases 
or peroxidases, or a chemiluminescer such as luciferin, or fluorescent compounds like fluorescein 
and its derivatives. Altematively, the probes can be made inherentiy fluorescent as described in 
hitemationaj AppUcation No. WO 93/16094. 

Various degrees of stringency of hybridization can be employed. The more stringent the 
conditions, the greater tiie complementarity that is reqiiired for duplex form^^^ Stringency can . 
be controlled by temperature, probe concentration, probe length, ionic strength, time, and the 
like. Preferably, hybridization is conducted under moderate to high stringency conditions by 
techniques Well know in the art, as described, for example in Keller, G.H., M.M. Manak (1987) 
DNA Probes, Stockton Press, New York, NY., pp. 169-170, hereby incorporated by reference. 

As used herein, moderate to high stringency conditions for hybridization are conditions 
which achieve the same, or about the same, degree of specificity of hybridization as the 
conditions employed by the current inventors. An example of high stringency conditions are 
hybridizing at 68^ C in 5X SSC/5X Denhardt's solution/0.1% SDS, and washing in 0,2X 
SSC/0.1% SDS at room temperature. An example of conditions of moderate stringency are 
hybridizing at 68?C in 5X SSC/5X Denhardf s solution/0.1% SDS and washing at 42^C in 3X 
SSC. The parameters of temperature and salt concentration can be varied to achieve, the desired 
level of sequence identity between probe and target nucleic acid. See, e.g., Sambrook et al. 
(1989) vide infra or Ausubel et al. (1995) Current Protocols in Molecular Biology, John Wiley 
& Sons, NY, NY, for fbrfher guidance on hybridization conditions. 

Specifically, hybridization of immobilized DNA in Southern blots with ^P-labeled gene 
specific probes was performed by standard methods (Maniatis et aL) In general, hybridization 
and subsequent washes were carried out under moderate to high stringency conditions that 
allowed for detection of target sequences with homology to the exemplified GALATl sequences. 
For double-stranded DNA gene probes, hybridization can be carried out overnight at 20-25** C 
below the melting temperature (Tm) of Ihe DNA hybrid m 6X SSPE SX Denhardfs solution, 
0.1% SDS, 0.1 mg/ml denatured DNA. The melting temperature is described by tiie following 
formula (Beltz, G.A., Jacobe, T.H., Rickbush, P.T., Chorbas, and F.C Kafatos [1983] Methods 
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ofErtzymology, R.Wu, L, Giossman and KMoldave [eds] Academic Press, New York 100:266- 
285). 

Tin=8L5** C + 16.6 I^g[Na+]-H).41(+GH<;)-0.61(%fannfflnide)-600/lengfli of diqilex in 
basepaiis'. 

Washes are typically earned out as follows: twice at room tempo^ture for 1 5 xnimites in 
IX SSPE, 0.1% SDS Qow stringency wash), and once at C for 15 minutes in 0.2X SSPE, 

0.1% SDS (moderate stringency wash). 

• For oligonucleotide probes, hybridization was earned out overnight at 10-20® C below the 
melting temperature (Tm) of the hybrid 6X SSPE, 5X Denhardt's solution, 0.1% SDS, 0.1 mg/ml 
denatured DNA. Tm for oligonucleotide probes was determined by the following formula: 
TMCC)=2(numberT/A base pairs +4(number QIC base pairs) [Suggs, S.V., T. Miyake, E.H., 
Kawashime, MJ. Johnson, K. Itakura, and R.B. WaUace (1981) ICB-UCLA Symp..Dev. Biol 
Using Purified Genes, D.I). Brown (ed:). Academic Press, New York, 23:683-693]. 

Washes were typically carried out as follows: twice at room temperature for 1 S minutes 
IX SSPE, 0. 1 % SDS (low stringency wash), and once at the hybridization temperature for 1 5 
minutes in IX SSPE, 0.1% SDS (moderate stringency wash). . . 

In general, salt and/or temperature can be altered to change stringency. With a labeled 
DNA firagment >70 or so bases in length, the following conditions can be used: Low; 1 or 2X 
SSPE, room temperature; Low, 1 or 2X SSPE, 42" C; Moderate, 0.2X or IX SSPE, 65**, C; and 
HigJh,0.1X SSPE, 65*^0- 

Duplex formation and stabilify depend on substantial complementarity between the two 
strands of a hybrid, and, a3 noted above, a certain degree of mismatch can be tolerated. 
Therefore, the probe sequences of tihe subject invention include mutations (both 3ingle and 
multiple), deletions, insertions of the.described sequences, and combinations thereof, wherein 
said mutations, insertions and deletions permit formation of stable hybrids with the target . 
polynudeotide of interest" Mutations, msertions, and deletions can be produced in a given 
polynucleotide sequeiice in many ways, and those methods are known to an ordinarily sl^Ued 
artisan. Other metiiods may become known in the future. 

Thus, mutational, insertional, and deletional variants of the disclosed nucleotide 
sequences cm be readily prepared by methods which are well known to those skilled in the art. 
These variants can be used in the same manner as the exemplified primer sequences so long as 
the variants have substantial sequ^ce homology with the origiiaal sequeiice. As used herein, 
substantial sequence homology refers to homolpgy which is sufficient to enable the variant 
polynucleotide to function in tiie same capacity as the polynucleotide firom which the probe was 
derived. Preferably, this homology is greater than 80%, more preferably, this homology is 
greater than 85%, even more preferably this homology is greater than 90%, and most preferably, 
this homology is greater than 95%. The degree of homology or idmtity needed for the variant to 
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function in its intended capacity depends upon the intended use of the sequence. It is well within 
the skill of a person trained in this art to make mutational, insertional, and deletional mutations 
which are equivalent in function or are designed to improve the function of the se^ence or 
otherwise provide a methodological advantage. Methods for confirming promoter activity an 
xylan- or xylose responsiveness are known in the art 

Polymerase Chain Reaction (PGR) is a repetitive, enzytnatic, primed synthesis of a 
nucleic acid sequence. This procedure is well known and commonly used by those skilled in this 
art [see Mullis, U.S. Patent Nos. 4,683,195, 4,683,202, and 4,800,159; Saiki et aL (1985) Science 
230:1350-1354]. PGR is based on the enzymatic amplification of a DNA firagment of interest 
that is flanked by two oligonucleotide primers that hybridize to opposite strands of the target 
sequence. The primers are oriented with the 3* ends pointing towards each other. Repeated 
cycles of heat denaturation of the template, annealing of the primers to their complementary 
sequences, and extension of the annealed primers, with a DNA polymerase result in the 
amplification of the segment defined by the 5' ends of the PGR primers. Since the extension 
product of each primer can serve as a template for the other primer, each cycle essentially • 
doubles the amount of DNA template produced in the previous cycle. This results in the 
exponential accumulation of the specific target fi:agment, up to several million-fold in a few 
hours. By using a thermostable DNA polymerase such as the Taq polyn^erase, which is isolated . 
fix>m the thermophilic bacterium Thermus aquaticus, the amplification process can be completely 
automated. Other enz^es wMch can be used are known to those sldUed in the art^ 

It is well known in the art that the polynucleotide sequences of the present invention can 
• be truncated and/or mutated such that certainof the resulting fragments and/or mutants of the . 
original full-length sequence can retain the desifedcharacteristics of the full-length sequence, A 
wide variety of restrictioii ©nzymes which are suitable for generating fi:agments from larger 
xlucleic acid molecules are well known. In addition, it is well known that BaB\ exonuclease can 
be conveniently used for time-controlled limited digestion of DNA, See, for example, Maniatis 
(1982) Molecular Cloning: A Laboratory Manual^ Cold Spring Harbor Laboratory, New York, 
.pages 135-139, incorporated herein by reference. See also Wei et al. (1983 J. Biol Chem. 
. 258:13006-13512. ByuseofjSaiSl exonuclease (conmionly referred to as "erase-^-base" 
procedures) j the ordinarily skilled artisan can remove nucleotides from either or both ends of the 
subject nucleic acids to generate a wide spectrum of fragments which are functionally equivalent 
to ttie subject nucleotide sequences. One of ordinary skill in the art can, in this manner, generate 
hundreds of fragments of controlled, varying lengths from locations all along the original xynA 
molecule. The ordinarily skilled artisan can routinely test or screen the generated fragments for 
their characteristics and determine the utility of the fragments as taught herein. It is also well 
known that the mutant sequences of the full length sequence, orfragments th^eoi^ can be easily 
produced with site directed mutagenesis. See, for example, Larionov, O.A. and Nikiforov, V.Q. . 
(1982) Genetika 18(3):349-59; Shortle, D, DiMaio, D., and Nathans, D. (1981) u4nntt. Rev. 
Genet 15:265-94; both incorporated herein by^reference. The skilled artisan can routinely 
produce deletion-, insertion-, or substitution-type mutations and identify those resulting mutants 
which' contain the desired characteristics of the ftill length wild-type sequence, or fragments 
thereof, i.e., those which retain promoter activity. 
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DNA sequences having at least 90, or at least 95% identity to the DNA sequences as 
given herein are most preferable. In addition, various functional equivalents are included in the 
definition of a GALATl coding sequences and traixscription regulatory, sequence. Following the 
teachings herein and \ising knowledge and techniques well known in the art, the skilled worker 
will be able to make a large number of operative embodiments having equivalent DNA 
sequences to those listed herein without the expense of undue experimentation. 

As used herein percent sequence identity of two nucleic acids is determined using the 
algorithm of Karlin tod Altschul (1990) Proc. Natl. Acad. Sci, USA 87:2264-2268, modified as 
inKarlinandAltschul(1993) Proc. Nati; Acad. Sci. USA 90:5873-5877, Such an algorithm is 
incorporated into tibieNBLAST arid XBLAST programs ofAltschul et al. (1990) J. Mol Biol. 
215:402-410. BLAST nucleotide searches are performed with the NBLAST program, score = 
JOO, wordlengfh = 12, to obtain nucleotide sequences with the desired percent sequence identity. 
To obtain gapped alignme9t5 for comparison purposes. Gapped BLAST is used as desaibed in 
Altschul et al. (1997) Nucl. Adds. Res. 25:3389-3402. When utUizing BLAST and G^ped 
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BLAST programs, the defiiult parameters of the respective programs (NBLAST and XBLAST) 
are used. See, for example, the National Center for Biotechnology hifonnation website on the 
internet. 

Monoclonal or polyclonal antibodies, preferably monoclonal, specifically reacting with a 
protein of interest can be made by methods well known in the art See, e.g., Harlow and Lane 
(1988) Antibodies: A Laboratory Manual^ Cold Spring Harbor Laboratories; Coding (1996) 
Monoclonal Antibodies: Principles and Practice^ 3rd ed., Acadraiic Press, San Diego, CA, and 
Ausubel et al. (1993) Current Protocols in Molecular Biology, Wiley Intersdence/Greene 
PnbUshing, New York, NY. * . . 

Standard technique for cloning, DNA isolation, amplification and purification, for 
enzymatic reactions involving DNA ligasQ, DNA polymerase, restriction endonucleases and the 
like, and various separation techniques are those known and commonly employed by those 
skilled in the art. A number of standard techniques are described in Sambrook et al. (1989) 
Molecular Cloning, Second Edition, Cold Spring Harbor Laboratory, Plainview,.New York; 
Maniatis et al. (1982) Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New 
York; Wu (ed.) (1993) Meth. EnzymoL 218, Part I; Wu (ed.) (1979) MetL Enzymol 68; Wu et al. 
(eds.) (1983) Meth. Enzymol 100 and 101; Grossman and Moldave (eds.) Meth. EnzymoL 65; • 
Miller (ed.) (1972) Experiments in Molecular Genetics, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York; Old and Primrose (198 1) Principles of Gene Manipulation, , 
. University of California Press, Berkley; Schleif and Wensink (1982) Pracrica/ Methods in 
Molecular Biology; Glover (ed.) (1985) DNA Cloning Vol. I and H, IRL Press, Oxford, UK; 
Hames and Higgins (eds!) (1 985) Nucleic Acid Hybridization, IRL Press, Oxford, UK; Setlow 
and HoUaender (1979) Genetic Erigineering: Principles and Methods^ Vols. 1-4, PlCTwm Press, 
New York; and Ansubel et al. (1992) Current Protocols in Molecular Biology, Greene/Wiley, 
New York, NY. Abbreviations and nomenclature, where employed, are deemed standard in the 
field and commonly used in professional journals such as those cited herein. 

All references cited in the present application are incorporated by reference herein to the 
extent that there is no inconsistency wi& the present disclosure. 
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APPENDIX I 



Se<iuence #1 • . • 
Gene name: At3g61130 

GeneBank .accession # for reference:. NIC115977.1' 
. • . • ". GI:184118S5 

Nucleotide sequence of Sequence. #1: . ■ * . 

Fosi^ions 314-2347.. ... 

A1?TCCGA GGATG6CGCT AAAGCGAGGG CTATCTGGAG TTAACCGGAT 

TAGAGGAAGT GGTGGTGGAT CTCGATCTGT GCTTGTGC*rT CTtATATTTT TCTGTGTTTT 
TGCACCTCTT? TGCTTCTTTG TTGGCCGAGG AGTGTATRTC . GATTCCTCAA ATGATTATq?C 
AATTGTTTCT GTGAAGCAGA ATCTTGACTG GAGAGAACGT TTAGCAATGC AATCTGTTAG 
ATCTCTTTTC; TCGAAAGAGA TACTAGATGT TATAGCAACC AGCACAGCTG ATTTGGGTCC 
tcttagcctt' GATTCTTTTA AGAAAAACAA TTTGTCTGCA TCATGGCGGG GAACCGGAGT 
AGACCCCTXiC TTTAGACATT CTGAGAATCC AGCAACTCCT GATGTCAAAT ctaataacct 
GAATGAAAAA CGTGACAGCA TTTCAAAAGA TAGTATCCAT CAGAAAGTTG agacacctac 

aaagattcac agaaggcaac taagagagaa aaggcgtgag atgcgggcaa atgagttagt 

TCAGCACAAT GATGACACGA TTTTGAAACT CGAAAATGCT GCCATTGAAC GCTCTAAGTC 
TGTTGATTC.T GGAGTCCTTG dTAAATACAG TATTTGGAGA AGAGAAAATG AGAATGACAA 
CTCTGATTCA AATATACGCT TGATGCGGGA TCAAGTAATA ATGGCTAGAG TCTATAGTGG 
GATTGGAAAA TTGIU^AAACA • AGAACGATTT? GTTACAAGAA .CTCCAGGCCC GACTTAAGGA 
CAGCCAACGG GTTTTGGGGG AAGCAACATC TGATGCTGAT CTTCCTCGGA GTGCGCATGA 
GAAACTCAGA GCCATGGGTC AAGTCTTGGC TAAAdCTAAG ATGCAGTTAT ATGACTGCAA 
GCTGGTTACT GGAAAGCTGA GAGCAATGCT TCAGACTGCC GACGAACAAG TGAGGAGCTT 
aaagaagcag- AGTACTTTTC TGGCTCAGTT AGCAGCAAAA accattccaa atcctatcca 
TTGCCTATCA ATGCGCrreA CTATCGATTA CTATCTTCTG TCTCCGGAGA AAAGAAAATT 
CCCTCGGAGT. GAAAAdCTAG AAAACCCTAA TCTTPATCAT TATGCCCTCT TTT.CCGACAA 
TGTATTAGCT GCATCAGTAG TTGTTAACTC . AACCATCATG AATGCCAAGG ATCCTTCTAA 
GCATGTTTTT CACCTTGTCA CGGATAAACT CAATTTCGGA GCAATGAACA TGTGGTTCCT . 
CCTAAACCCA CXTCGGAAAGG CAACCATACA' TGTGGAAAAC GTCGATGAGT TTAAGTGGCT 
CAATTCATCT TACTCTCCTG TCCTTCGTCA GCTTGAATCT GCAGCAATGA GAGAGTACTA 
TTTTAAAGCA GACCATCCAA CTTCAGGCTC. TTCGAATCTA AAATACAGAA ACCCAAAGTA 
TCTATCCATG TT6AATCACT ' TGAGATTCTA CCTCCCTGAG GTTTATCCCA AGCTGAACAA 
AATCCTCTTC CTGGACGATG ACATCATTGT TCAGAftAGAC TTGACTCCAC TCTGGQAAGT 
TAACCTGAAC GGCAAAGTCA ACGGTGCAGT CGAAACCTGT GGGGAAAGTT TCCACAGATO 
CGACAAGTAT CTCAACTTTT CGAATCCTCA CATTGCGAGG AACTTCAATC CAAATGCTT6 
TGGATGGGCT TATGGAATG A • ACATGTTCG A CCTAAAGGAA TGGAAGAAGA GAGACATCAC 
TGGTATATAC CACAAGTGGC AAAACATGAA TGAGAACAGG ACACTATGGA AGCTAGGGAC^ 
ATTGCCACCA GGATPAATAA CATtCTACGG ATTAACACAT CCCTTAAACA AGGCGTGGCA 
TGTGCTGGGA' CTTGGATATA ACCCGAGTAT CGACAAGAAG GACATTGAGA ATGCAGCAGT 
GGTTCACTAT AACG66AACA TGAAACCATQ GTTGGAGTTG GCAATGTCCA- AATATCGGCC 
GTATTGGACC AAGTACATCA AGTTTQATCA . CCCATATCTT CGTCGTTGCA ACCTTCATGA. 
ATAAAAT 



Amino Acid S&[S!denc& of Sequence #1: 
GeicieBank iD# NP_191672 
Positions 1-673. 

MALKRGLSGVNRIRGSGGGSRSVLVLLIFFCVFAPLCFFVGRGV 

YIDSglSroYSIVSVKQNLDWRERLAMQSVRSLFSKEILDVIATSTADLGPLSIiDSFKKN 

NliSASWRGTGVDPSFRHSENPATPDVKSNNLNEIOmSISKbiSIHQKVOT 

REKRREMRANELVQHNDDTILKLBNAAIERSKSVDSAVIXSKy 

RLJlRDQVIMAKVYSGIAKLKNKNDLLQELQARIiKDSQRVM 

AMGQVIIAKAKMQMPCKLVTGKLRA^ILQTADEQVRSLKKQOT 

IiSMRLTIDYYLLSPBKRKFPRSENIiENPNIiYHYALFSDIIVLAASX^^ 

KHVFHLVTDKLNFGAMNMW 

EYYFKADHPTSGSSKnCjKYRNPKYLSMLiraijRFyLPEyYPEa^ 
PLWEVlSnJNGKVNGAVETCGES^TJRFDKilJNFSOT 
WK.KRDITGIYHKWQmiNEip'njWKLGTI.PPGIiITFYGLa^ 
KIpXE^rAAVVHYNGSIMKPvLBIlAMSKYRPyWTI^ 



Sequence #2 ... 
Gene name: At2g38650 

GeneBank accession # £or reference: AY050982 

GIil5293096 

Nucleotide- sequence of 5equen<:e #2: ' 
Positions 198-2057, 

* . . *' ' ' . ' ' ' 

Aa?G AAA GGC GGA GGC GGT GGT GGA GGA GGT GGT GGC GGA GGA AAA CGC CGG TGG 
• AAA GTT CTG GTC ATT GGA GTT TTG GTT. CTT GTT ATT CTT TCT ATG CTT GTT CCT 
CTT GOT TTC TTA CTC GGT CTT GAG AAT GGC TTT CAC TCT CCT GGA* TTT GTC ACT 
GOT CAA CCG GCT TCT TCA TTT GAG AGC TTT ACC AGA ATC-AAT GCT ACT AAG CAT 
ACA CAG AGA GAT GTA TCC GAA CGG GTC GAT GAG GTT CTT CAA AAA ATC AAT CCA 
GTT CTT -CCC AAG AAA AGC GAC ATA AAC GTG. GGT TCC AGA GAT GTG AAT GCA ACA 
. AGC GGC ACT, GAT TCT* AAA AAA AGA GGA TTA CCA GTG TCC CCA ACT GTT GTT GCC 
.AAT qCA AGC CCT GCA AAT AAA ACA AAA TCG GAA GCC TCA TAT ACA GGT GTT CAG 
AGG AAA -AtA GTA AQT GGT GAT GAA ACT TGG AGA ACT TGT GAA GTG AAA TAT GGG 
AGC TAC TGC CTC TGG AGG GAG GAA AXT AAG. GAA CCA ATG AAA GAT GCC AAG GTG 
AAG CAA. ATG AAG GAC CAG CTG TTT GTG GGT AGA GGA TAC TAT CCC AGT ATT GCT 
. AAA. ATG CCT TCT CAA AGG' AAG TTG ACT* CGG GAT- ATG AAA CAG AAT ATC CAA GAg' 
TTT GAG CGT ATT 'CTT AGT GAA AGT TCT CAA GAT GCT GAC CTT CCA CCA CAG GTT 
GAT AAA AAG TTG CAG AAG ATG GAA GCT GTA ATT GCA AAG GCA -AAG TCT TTT CCA 
GTC GAC ..TGT AAC. AAT GTT GAC AAG- AAA TTG AGA CAG ATC CTT GAT TTG ACT GAG 
GAT dAA GCT AGT TTC CAC ATG AAA CAG AGT GTG TTG CTG TAC CAG CTT GCA GTA 
CAG ACA ATG CCT AAG . AGT CTT'. CAT TGC TTG TCA ATG CGA CTA ACT GTG GAA CAT 
TTC. AAG TCA GAT TCA CTT GAG GAT CCC ATT. AGT GAG AAA TTT TCA GAT CCC TCA 
TTA CTT CAC TTT GTT. ATC ATC TCC GAT AAT ATA CTA, GCA TCG TCC GTT GTG ATC' 
AAC TCA ACG GTT GTA CAT GCA AGG GAC AGT AAA AAC TTT GTT TTC CAT GTA CTG 
ACA . GAC GAG CAG AAT TAC TTT GCA ATG AAA CAA TGG TTT ATT AGG AAT CCT TGC' 
AAA CAA TCA ACT GTT CAA GTA TTQ AAC ATT GAA AAA' CTC GAG CTG GAC GAT TCT 
GAT ATG AAA CTG TCT TTG TCT- GCG GAG. TTC CGT GTT TCC .TTC CCC AGT GGT GAC 
CTT TTG GCG TCT CAA CAG AAT AGA ACA CAC TAC TTA TCC CTT TTC TCT CAA TCT 
• CAC TAT CTT GTT CCC AAA TTA TTT GAC AAA TTG GAG AAG GTT GTG ATT CTG GAT 
GAT .GAC GTT GTA GTC CAG CGA GAC TTA TCT CCC CTT TGG GAC CTT GAT ATG GAA 
GGG. AAA GTG AAT GGC GCT .GTT AAG TCG TGC ACT . GTG AGA TTG GGT CAG CTA AGG 
AGT CTC AAG AGA 'GGA AAT TTT GAT ACC AAT GCT TGT CTC TGG ATG TCT GGT TTG 
AAT. GTG GTT GAT CTT GCT AGA TGG AGG GCA TTG GGT GTT TCA GAA* ACC TAT CAA 
AAA TAT TAT AAA GAG ATG AGT AGT GGA GAT GAG TCG AGC GAA GCA ATT GCA TUG 
CAG GCA AGC TTG CTQ ACA TTT CAA GAC CAA GTA TAT GCT CTT GAC GAC AAA TGG 
GCT CTA.TCA-GGG CTT GGT TAT GAC TAC TAC ATC AAT GCA CAA GCC ATA AAA AAC ' 
GCA GCC ATA TTG CAC TAT AAC GGG AAC ATG AAG CCG TGG CTT GAQ CTG GGA ATC ' 
CCA AAT TAC AAA. AAC TAT TGG. AGA AGG CAT CTG AGT CGG GAA GAT CGG TTC TTG 
AGT GAC. TGT AAC GTG AAT CCT TGA 



AmxTio Acid' Sequence o£ Seqcuence #2: < 

GeneBank ID# NP^65893 

Positions 1-619. • - * . 

. MKGGGGGGGGGGGGKRRWKVLVniGVLVIiVILSMLWIAFLLG^ * 

NGFHSPGFVWQPASSFESFTRINATKHTQRDVSERVDEVLQKINPVLPKKSDIWGS 
RPVWATSGTDSKKRGLt*VSPTVVANPSPANKTKSEASYTGVQR^ 
YGSYCLWREENKEPMKnAKVKQMKDQLFVARAYYPSIAK^ 
RILSESSQDADLPPQVDKKIiQKMEAVIAKAKSPPVipCNNVDKKLRQILDL^^ 
MKQSVFLYQIAVQTMPKSLHGLSMRliTVEHPKSDSIiEDPISERFSDP.SLL 
' ILASSWINSIVVHARDSKNFVFHVLTDEQNYFAM^ 

IiDDSbMKXiSLSAEFRVSFPSGDtiLASQQNRTHYLSLFSQSHYriLPKLFDKLEKVVILD 

DDVWQRDIiSPLWDLDMEGKWGAVKSCTVRLGdLRSLKRGNFDTNACLV^ 

XjmWRALGVSETYQKYYKEMSSGDESSEAIALQASLLTFQDQVYAIiDDKWAIiSG 



Sequence #3 .. *' . , 
Gene name: At5g47780 

GeneBank .accession # for reference: Ay056202 

GI: 15810326 

Nucleotide sequence of Sequence #3 : • ' 
Positions 66-1914 • f. 

ATG ATG GTG -AAG CTT CGC AAT CTT 6TT CTT TTC TTC ATG CTC CTC ACC GTC GTT 
. OCT CAT ATC CTT CTC TAG ACC GAT CCC GCT GCC TCC TTC AAG ACC CCC TTT TCT 
AAA CGC GAT TTC CTC GAG GAC GTA ACC GCC TTG ACT TTC AAT TCC GAT GAG AAT 
CGT TTG AAT CTT CTT CCT CGG-GAA TCT CCC GCT GTG CTC AGA GGA GGA CTC GTC 
GGT GCT GTC TAT TCC GAT AAG AAT TCA CG6 CGG CTA GAC CAA 'TTG TCT OCT CGA 
GTT CTT TCC GCC ACC GAC GAT GAT ACT CAC TCA CAT ACT GAC ATT TCC ATG AAA 
CAA. GTC ACT CAT GAT GCA GCC TCA GAC TC6 CAT ATT AAT AGG GAA AAT ATG CAT 
GTT CAA TTG ACC. GAA CAA ACC T^T GAA AAA GTT GAT GAG CAA CCA GAG CCT AAT 
TTT GGA GCT AAG AAA . GAT ACT GGA AAC GTG TTG ATG CCT GAT GCT CAA GTG 
AGG CAT CTT AAA GAT CAG CTT ATT AGG GCA AAG GTT ^AT CTT TCC CTT CCA TCT 
GCA AAG GCC AAT GCT CAT TTT GTG AGA GAG CTT CGA CTC CGT ATt AAA GAA GTT 
CAA CGG GCA CTT. GCA GAT GCC TCC AAG* GAST TCG GAT CTG CCA AAG ACT GCT ATA 
GAA. AAG CTA AAA GCA ATG GAG CAA AGA. CTG GCC AAA GGC AAG CAG ATC CAA GAT' 
GAC TGT TCT ACA GTG GTC AAG AAG CTA CGT GCT - ATG CTC CAC TCQ GCA GAT GAG 
CAG CTA CGG GTC CAT AAG AAG CAA ACC ATG TTT TTG ACT CAA TTG ACT GCT AAG 
ACC ATT CCT AAA GGA CTT CAC TGC CTT CCT CTG CGC CTC ACT ACA GAC TAT TAT 
GCT TTA AAT TCA TCT GAA CAA CAA TTT CCA AAT CAG GAG AAA CTA GAA GAT ACT' 
CAG. CTG TAT .GAC TAT GCC CTT TTC TCT GAT AAT GTT. TTG GCT ACG TCA GTT GTT 
GTT AAC TCT- ACC ATA ACC AAT GCA AAG CAT CCC TTA AACCAT GTC TTC CAC ATC 
GTC ACA. GAC AGA CTC AAT TAT GCG GCA ATG. AGG ATG TGG TTC CTG GAC AAT CCA 
CCT GGC AAA GCC ACC ATC CAG GTT CAG AAT GTT GAA GAA TTT ACA TGG CTG AAT 
TCA AG.C- TAC AGT CCC GTT CTC AAA CAG CTT AGT.TCT AGA TCG ATG ATA GAT TAT ' 
TAG TTC AGA GCC CAC CAT ACA AAT TCA GAC ACC AAC TTG AAG TTC CGG AAT CCA 
AAA TAC TTA TCG ATC CTT AAT CAT CTT CGT TTT TAC TTG CCT GAG ATC TTT CCC 
AAG CTC AGC AAA GTG CTC TTC TTG GAT GAT .GAT ATA GTT GTG CAjS AAG GAQ CTT 
TCT GGT CTT TGG TCA GTT GAT CTG AAA. GGT AAT GTT AAC GGT GCT GTA GAG ACG 
TGT. GGG GAA AGC TTT CAT CGC TTT GAC CGT TAT CTG AAG TTC TCA- AAT CCA CTC 
ATT TCC AAG AAC TTT GAC CCT CGA GCT TGT GGT TGG GCG TAT GGT ATG AAT GTC 
TTT GAT CTG- GAT GAA TGG AAG AGG CAA AAC ATC ACA GAA GTT TAT CAT CGA TGG 
CAG GAT CTG AAT .GAA GAC CGA GAA TTG TGG AAG CTA GGG ACG TTG CCG CCT GGT 
CTA ATC ACA TTT TGQ. AGA CGA ACA TAT CCG CTA - GAC CGG AAA TGG CAC ATA CTA 
GGG CTT GGA TAC AAC CGG AGT GTG AAC CAA AGG GAT ATT GAG AGG GCA GCC GTG 
ATA CAC TAT AAT GGC AAC' CTC AAA CCA T6G CTA GAG ATT GGG ATT CCA AGA TAC 
AGA GGC TTC TGG TCA AAG CAT GTA- GAC TAT GAG CAC GTT TAT CTC AGA GAA TGC 
AAC ATC AAT! CCT TAG 



Amino Acid Sequence o.f Sequence #3 : 
Genebank IP# AAL07051 
Positions 1-:615- • • 

. MMVKriMOiVLFFMLLTVVAHILLYTDPAASFKTPFSKRDFLEDV 
TAXiTFNSDENRLNLLPRBSPAVLRGdLVGAVySDKNSRRMJ&IiSARVLS 
TDI SIKQVTHDAASDSHINRE?5MHVQLTQQTSEKVDEQPEPNAFGAKKOT 
QVRHriKbQLIRAK\^LSLPSAKANAiIFVRELRLRIKEyQRAI^ 
LKMlEQTIiAKGkQIQDDCSTVVKKriRAMLHSADEQLE^\^^ 
HCLPIiRLTTDYYAlJJSSEQQFPWQEKLEDTQIjYHYAbFS^ 
PXiKHWHIVTDRIiNYAAMRMWPLpNPPGKATIQVQNVEEPT^^ 
MIDYYFRXHHTKrsr)™:iKFRNPKY^ 

SGLWSVDLKGNW6AViE;t'CGESFHRFDRYIiNFSNPrjISKNFpPRAC<^ 
• EWKRQNITEVYHRWQDLNQDRELWKIiGTLPPGLITFWRRTYPIiDRK^ 
NQRDIERAAVIHTOGNLKPWLEIGiPRYRGFWSKHVDYEHV^^ 



Sequence #4 

Gene name: AtlgOG780 

Genepahk accession #. for. reference: AY091452 

• GX: 20259310 

Nucleotide sec[uence of Sequence #4: 

Positions 1-1770. *, . 

ATG AAA CAA ATT CGT CGA TGG CAG AGG ATT TTG ATC CTC GCT CTG CTA TCG MH^ 
TCA GTA TTC' GCT CCG CTT ATT TTC GTA TCG AAT CGG CTT AAG AGC ATC ACT CCC 
GTT G6T CGT AGA GAA TTT ATT GAA GAG TTA TCC AAA ATT AGA TTC ACG AQPl' AOT 
GAG CTT AGA CTT AGC GCT ATT GAA CAT GAG *GAT GGA GAA .GGC TTG AAG. GGG .CCA 
AGG CTC ATT CTC TTC AAG GAT GGG GAG* TTT' AAT TCG TCT GCT GAA AQT GAT GOT 
GGT AAT ACT TAC AAA AAC AGG GAA GAA CAA GTG ATT GTT TCA CAG AAG ATG ACA 
. GTT AGC TCT GAT GAA AAG GGT. CAA ATT CTA CCA ACA GTC AAC CAA CTT GCT ART 
AAA ACQ GAT TTC AAG CCC CCT TTA TCT AAG GGT. GAA* AAG AAC ACA AGG GTT CfiS 
CCC GAC AGA GCA ACA GAT GTG' AAA ACG AAG GAG ATC AGA GAC. AAA ATT ATT CfflS 
GCT AAA GCC TAC CTG AAT TTC GCT GCA* CCT GGA AGT. AAC TCT CAA GTT GTG ABG 
GAG TTG AGA GGT CGG CTG AAA GAG CTG GAA CGG TCT GTT GGT GAT GCA ACA, AfiS 
GAC AAG GAC TTA TCA A^ GGC GCT CTC CGC AGG GTG AAG CCC ATG GAA AAT G3S 
TTA TAT .AAG GCT AGT CG* GTC TTT AAC AAT TGC .CCT GCC ATC GC^P ACC AAA CafC* 
CGT GCC ATG AAT TAT AAC ACA GAA GAA CAA GTT CAG GCG .CAG * AAA AAT CAA GCA 
GCG TAT CTA. ATG CAG CTT GCA GCA AGG ACC ACC CCA AAA GGG CTT CAC TGT CEC 
TCA. ATG CGG CTG ACA TQA GAA TAC TTT TCA CTG GAT CCT GAA AAA AGG CAG ATG^ 
CCT AAC CAG CAA AAT. TAT TTT GAC GCT AAT TTC AAT .CAT TAT-. GTT GTC; TTG TCT 
GAC AAT GTT TTG GCT TCT TCA GTC GTT GTT AAC TCT ACG ATA TCT TCA TCA A2I& 
GAG CCA GAA AGA ATA GTC TTC CAT GTC GTG ACT GAT TCA CTT AAT TAC CCA GCA 
ATC- TCA ATG TGG TTT CTG CTA '.AAC ATT CAA AGT.' AAA GCT ACT* ATC CAA ATC Clft 
AAC. ATT GAT GAT ATG GAT GTC CTG CCT. AGA GAT TAT GAT CAA TTA CTG ATG AffiS 
CAA* AAC. TCT AAT GAC* CCA AGA TTC ATT TCT ACA CTC AAT CAC -GCA CGC TTC TSa? 
CTQ CCG GAT ATA TTC CCG GGT TTG AAC AAG ATG GTA CTC TTG GAC CAT GAT GT&. 
GTT GTT CAA AGA GAT TTA AGT AiSA CTG T?GG AGC ATT GAT ATG AAA GGA AAG GT«C 
GTT GGA GCT GTA GAG ACT TGT CTT GAA GGT GAA TCT TCA TTT CGA TCA ATG AGC 
ACA TTT ATT AAT TTC TCA GAC ACA TGG GTC GCT GGG AAA TTT AGT CCT AGA GCF 
TCC ACA TGG GCT TTC GGG ATG AAT CTA ATT GAT CTC GAA . GAA .TGG AGA ATA COS 
AAG. TTG ACT TCT ACA TAC ATA AAA TAC TTC AAC CTG GGA ACA AAG. AGA, CCA TTC. 
TGG AAA GCT- GGG .AGC TTA CCA ATA" GGT TGG TTG ACT' TTC TAT AGG" CAA ACA^ rfiia* 
GCA TTG. GAC AAG AGA TGG CAT GTG ATG GGG TTA GGT CGC GAA TCTA GGA GTC. AA& 
GCG GTT . GAC ATC GAA CAA GCG GCA GTT ATA QAC TAC GAT GGG GTC ATG AAG COS 
TGG TTG GAC ATT GGA AAA GAG AAT TAC AAA CGT TAC TGG AAC ATA CAC GTC CCIf 
TAC CAT CAC ACC TAC TTG CAA* CAG TGC AAT CTT CAA GCT T«3A 



Amino Acid Sequence of Sequence #4: 
Genebank ID# NP_5 63771 
Positions 1-589 • 

MKQIRRWQRILIIiALLSISVFAPLIFVSNRIiKSITPVGRREFIE 
ELSKIRFTTNDLRLSiijCEHEIXSEGLKGPRLILFiaDGEFNSSi^ 

^QAKAYLNFAPP6SNSQVVKELRGRLKELERSV6DATKDKDLSKGA]jIalVKPMEN^^ 

KASRVFNNCPAIATKLRAMiran'EEQVQAQKNQAAYI^ 

EYFSLDPEKRQMPNQQNYFDANFNHYVVFSDNVI*ASSV^^ 

TDSLir^PAISftnflJFLLNXQSKATIQILNipDMDVLPRDYPQIil^QNS 

ARFYLPDIFPGbNKMVLLDHDVWQRDLSRLWSIDMKGKWGAVETCLEGESSFRSM 

TFINFSDTWVAGKFSPRACTWAFGMNIilDLEEWRIRKLTSTYEKYFl^GTKRPL^ 

SLPIGWTiTFYRQTIAIJDKRiraVMGIXSRESGVKAVblEQA^^ 

YKRYWNIHVPYHHTYLQQCNIiQA 



Sequence #5 . 

Gene name: AtiglSSSO ' . * 

' GeneBank accession # for reference: ^AY062444 

GI: 17064735 

Nucleotide sequence of Sequence #5: . ' . • 

Positions 281-1892. / ' 

• -t . • 

ATG AGG • CGG TGG CCG 6TG GAT CAC CGG CGG CGA GGT AGA AGG . AGA TTG ICG AGT 
TGG ATA TGG .TTT CTC CTT GGT TCT TTC TCT GTC GCT GGT TTA GTT CTC TlIC ATC 
GTT CAG CAT TAT CAC .CAT CAA CAA GAT CCA TCC CAG CTT TTA CTT GAG AfiSA GAC 
ACG AGA ACC GAA ATG GTA TCT CCT GCC CAT TTA AAC TTC ACG GAA GAG GEC ACA 
AGT GCT TCC TCC TTQ TCT AGG CAG TTA GCA GAG CAA ATG ACA CTT GCC ARA GCT 
TAT GTG TTT ATA GCT AAA GAG CAT AAT AAT CTT CAT TTA GCT TGG GAA T!EG AGT 
TCT AAG ATC AGA AGT. TGT CAG CTT TTG CTT rteC AAA GCA GCT ATG* AGA O&A CAA 
CCT ATT TCG TTT GAT GAG GCT AAA ' CCG ATT ATT ACT GGT CTA TCA GCT CSV ATC • 
TAG AAG GCT CAA GAT GCA CAT TAT GAT ATT. GCC ACC ACT ATG ATG ACC AaUG AAA 
TCT CAC ATC CAA* GCA CTT GAA GAG CGT GCA AAT GCA GCT ACT GTT CAG ACC ACA 
ATA TTT GGG GAA TTG GTT GCT GAG GCA TTA CCA AAG AGC" CTC CAC TGT' T3EG ACG " 
ATA AAG CTC ACA TCT GAT TGG GTA ACA GAG CCA TCT. CGC CAT GAA CTG OCA GAT 
GAG AAC AGA AAC TCA CCT AGA CTT GTC GAC AAC AAC CTC TAC CAC. TTC TGQ, ATC 
TTC» TCG GAC AAC GTO .ATT GCC ACC TCG GTT GTT GTT AAT' TCA ACT GTC TCG AAT 
GCT GAT CAT CCA AAG CAG CTT GTT TTC CAC ATA. GTG ACG AAT .CGA GTG M3C TAC 
AAA GCT ATG CAG GCQ TGQ TTT CTA AGT AAT QAG TTC AAG GGC TCA GCA ASEA GAG 
ATC AGG AGC GTA GAG GAG TTT TCT . TGG TTG AAT GCT TCA TAT TCT CCT GffT GTT ' 
AAG CAA CTG, PTG GAC ACA GAT GCA AGA GCT TAC TAT TTC GGG GAA CAG ACA ACT 
CAA GAT ACG ATT TCC GAG CCA AAA GTG AGG AAC CCA AAG TAC TTG TCA TEPt CTG 
AAC. CA*? CTC AGA. TTC TAC ATT CCG GAG* ATC TAT CCA CAG CTA GAG AAG A-m GTT 
TTC CTA GAC. GAT GAT. GTT. GTT GTT CAG AAA GAT TTG ACT CCA CTG TTC TOC TTG 
GAT CTG, CAT .GGA AAC GTC" AAT GGA GCT GTG GAA ACA TGT CTT GAA . GCC TTfc' CAC 
CGA- TAT TAC AAG TAT CTA AAT TTC .TCG AAC CCA CTC ATC AGC TCA AAG TTC GAC 
CGA CAA GCA TGT GGA TGG GCT TTT GGT ATG AAC GTT TTT GAT CTG ATC GCT TGG 
AGG AAT GCA AAC GTQ ACT GCT CGG TAC CAT TAC .TGG CAA GAT CAG AAC AGZt GAA 
CGA ACG CTT TGG AAA CTC GGG ACA. CTC CCT CCA GGT CTA CTA' TCT TTC TJWT GGT 
CTC ACA GAG CCA CTG GAC AGA AGA TGG CAT GTC TTG GGT TTA GGT TAC GM GTG 
AAC ATC GAT AAC CGT CTG ATC GAA ACA GCA GCT GTG ATT CAC TAT AA^' GST AAC 
ATG AAG. CCT TGG CTA" AAG CTG GCT ATT GGT AGG TAT AAA CCT TTC .TGG TIER. AAG ', 
TTT /I*TG AAC T^G AGfc CAT CCT TAT TTA CAA GAT TGT GTC ACA GCT TAA 



Amino Acid Sequence of S^quencQ #4: . 
Genebank XD« a*ak93644. 
Positions 1-537 • • 



tdRRWPVDHjmRGRRRLSSWIWFLliGSPSVAGLVLPIVQHYHHQQDPSQLLI^^ 
.TSASSFSRQI1A6QMTI1AKAYVFIAREHNNLHI1AWEI.3 
LIYKTVQDAHYDIATTMMOMKSHIQALEEHANAATVQTTIFGQIiVAEAI^ 
XjaJENRNSPRLVPNNIiYHFCIFSDNVIATSVVVNSTVS^ 

SAIEIRSVEBFSWLNASYSPVVKQLLDTDARAYYFGEQTSQDTISEPKVRNPKYLSLLNHLR^ 
LEKIVFLDDDVVA^QKDLTPLFSBDIiHGNWGAVETCLEAFHRYYKYllJNFSOT 
FDIiIAWRNANVTARYHYWQDQNRERTLWKIiGTLPPGI.i:iSFYQ^^ 
VIHYNGNMKPWLiOAIGRYKPFWLKFIiNSSHPYLQDC^ 



Sequence #6 . 
Gene name: At2g20810 

GeneBank accession # for reference: AY059085.1 61:16323393 
Nucleotide sequence of Sequence #6: ' • 
Positipn's 1-1609- 

ATG AGA AGO AGA GGA GGG GAT AGT TTC CGG AGA GCT GGA CGG AGG. AAG ATC TCG 
AAT GTG GTA TGG TGQ GTT CTC TCT GGT ATT. GCC CTC CTG CTC TTC TTT CTC ATT 

• CTC TCC AAA GCT GGT CAT ATT GAA CCT AGA CQC TCT ATT CCT AAG CGA CGT TAG 
CGT AAT GAC AAA TTT GTA GAG GGT ATG AAT ATG ACT GAG GAA ATG TTG AGT CCT 
ACT TCC GTT GCT CGT CAA GTT AAT GAT CAG ATT GCT CTT GCT AAA GCT TTT .GTT- 

. GTC ATT GCT AAA GAA AGT AAG AAT CTT CAG TTT GCT TGG GAC TTA AGT GCT CAG 
ATC CGT AAC TCT CAG TTG CTT TTA TCG AGT GCT GCT ACT AGG AGA AGT CCC TTG 
ACT GTC. TTG GAA TCT GAG TCT ACT ATT CGT GAC ATG GCT GTT TTG TTA TAT CAA 
GCT CAG CAG CTT CAC -TAT GAT AGT GCT ACT ATG ATT ATG AGG CTT AAG GCC TCG 

' ATT CAG GCT CTT .GAA GAA CAA ATG AGT TCC GTT AGC GAG AAG AGT TCC AAG TAT 
GGA CAG, ATT GCT GCT GAG GAA GTG CCT AAG AGT CTT TAG TGT CTT GGT GTT CGT 
CTC ACT. ACC GAA TGG TTT' CAG AAT .TTA GAC ..TTA CAG AGA ACT CTT AAG GAA AGG 
AGT CGT GTT GAT TqG AAA QTC ACG GAT AAC AGT CTC TAC CAT* TTC TQT GTG TT^P 
TCC GAT AAC ATT ATT GCT ACT TCT GTT GTG GTT AAT " TCT ACT GCT CTC AAT TCC 
AAG GCC CCT GAG, AAA GTT GTG TTT CAT CTT GTG ACT AAT GAG ATC AAC TAT. GCT' 
GCA ATG AAG GCT TGG TTC GCC ATT* AAT ATG GAC AAQ CTC AGA GGA" GTC ACT GTG 
GAG GTT CAG AAG .TTC GAG GAT TTC TCA TGG CTG AAT GCT TCC TAT GTT CCG GTC 
CTC - AAG CAG CTG. CAA .GAC TCT GAT ACG CAA AGC TAT ^tAT TTC " TCT GGA CAC AAC 
GAT GAT 'GGG CGC ACT . CCA ATC. AAA TTC AGG AAC CCC AAG TAT CTT TCC ATG CTC 
AAC CAT CTT AGG TTC TAC ATC CCT GAA GTG TTT CCT GCG GTG AAG AAG GTG GTC 
TTT CTT GAT GAT GAT GTT GTA GTT..CA<5 AAG" GAT . CTT TCA TCT QTC- TTT TCG ATC 
GAT TTA AAC AAA AAT GTG AAC GGG GGT GTT GAG ACC TGC ATG GAG ACC TTC CAC 
CGC TAC CAC AAG TAC TTG AAC TAT TCT CAT CCT, CTC ATA CGC TCC CAC TTT GAT 
CCA GAT GCG TGT GGG TGG GCG TTT GGA ATG AAC GTC TTT GAT TTA GTT GAG TGG 
AGG AAG AGA AAT GTG ACC GGC ATA TAC CAC TAC TGG CAA GAA AAA AAC 'GTG GAC 
CGG ACC TTA TGG AAA .CTG GGA ACA CTA CCT CCA GGA CTT CTG ACA TTT TAC GGG 
TTA ACA GAG GCA CTA 6aG GCG TCC TGG* CAT ATC CTG GGA TTG GGA TAC. ACG AAT 
GTG GAT GGT CGT GTG ATA GAG ' AAA GGA GCT GTT CTT CAC TTC AAT GGG AAC TTA 
AAG CCA TGG- TTG AAG ATC GGG ATA GAG AAG TAC .AAA CCT TTG TGG GAG AGA TAC 
GTT GAT TAC ACT TCT CCT TTT ATO CAA .CAA TGC .AAT TTT. CAT 



Amino Acid Sequence of Sequence. #6: 
Genebank ID* NP_565485 
Positions' 1-536. 

JtfRRRGGDSFRRAGRRKISNVVWWVLSGIALLIiFFrilLSKAGHIEPRPSIPK^ 
PTSVARQVNDQIAIiAKAFWIAKEjSKNLQFAWDLSAQIRNSQLLLS 
YQAQQMYDSATMIMRLKASIQA^iEBpMSSVSEKSSKyGQIAAEEVPKSLYCXGV^ 
KERSRVDSKLTPNSLYHFCWSDNIIATSVVVNSTAriNSK^ 

GVTVEVQKFEDFSWXiNASWPVLKQIiQDSDTQSYYFSGHNDDGRTPIKFIWPKYLS^ 
LKKVVFLDDDVWQKDIiSSLPSIpLBIKNWGAVETCMETPHRra 
FDIlVEWRKRNVTGIYHYWQEK^^VDRmWKLGTLPPG^ 
LHFNGNLKPWriKIGIBKYKPtiWERYVDYTSPFMQQCNFH 



Sequence #7 . . <• ' ^ . 

Gene n^e: At:2g30S75 

GeneBank accession # for reference: NWL147355.1 GX:22326004 

Nucleotide sequence of Sequence #7:* 
Positions* 303t2135. * • 

ATG AAfP CAA GTT CGT CGT TGG CAG AGG ATT GTG ATC CTC TCG CTG CTA •yTG Tl*A 
TCT- GTT.TTA GOT COG ATT GTT TTC GTT TCG AAT CGG CTC AAG AGC ATC ACT TCC 
GTC GAT AGA GGA GAA TTC ATT GAA GAA TTA TCC. GAC ATT ACA GAT AAG ACC GAG 
GAT GAA CTT AGA CTT ACT GCT ATT GAA CAG GAC GAA GAA GGC TTG AAG GAG CCT 
AAA CGT ATT CTG CAG GAT CGA GAT TTT AAT TCT GTG GTT TTG TCA AAT TCC TCT 
GAT AAA AGT AAT GAT ACT GTG QAG TCT AAT GAG GGA GAC CAA AAA AAQ TTT CTC. 
TCA GAA GTT GAT AAG GGA AAT AAT CAC. AAA- CCA AAG GAG GAA CAA GCA GTT TCA 
CAG- AAA ACC ACA GTA AGC TCG AAT.QCG GAG GTG AAA ATT TCA- GCA AGA GAT ATT 
CAA CTT AAT CA*r AAA ACG GAA TTC CGA CCC CCT TCA AGT AAG AGT GAA AAG AAT 
. ACA AGG GTT CAA CTT GAA AGA GCA ACA GAT GAG AGG GTA AAG GAG ATC AGA GAC 
AAA ATT ATC CAA GCG AAA GCC TAT CTG AAT TTG' GCC CTA CCT GGG AAT AAC TCC 
' CAA ATC GTA AAG GAG' TTG AiGA GTT CGA ACG AAA GAG CTG GAA CGG* GCT ACT GGT- 
GAT ACT ACC AAiS GAT AAA TAT TTG CCA AAG AGC TCT CCT AAC AGA TTG A^G GCC 
ATG GAA. GTT GCG TTA TAG AAG GTC AGC CGT GCC TTT CAC AAG TGC CGT GCC ATT 
GCT ACC AAA CTC CAA GCC AIG ACT TAT AAA ACC GAA- GAA CAA GCT CdG GCG CAG 
AAG AAA CAA GCA GCA TAT TTA ATG CAG CTT GCA GQA AGG ACT' ACC CCA AAA GGG 
CTT CAT TGT CTC TPA ATG CGG TTG ACA ACA* GAA TAt TTT ACC CTG GAT CAC GAA 
•AAA AGG CAG CTT TTG CAA CAA AGT TAT AAT GAT CCT; GAT CTC TAC CAT TAC GTA 
GTC TTC TGT GAC AAT GTT TTG GCC TCT TCG GTT .GTT C^TT AAC TCT ACA ATC TCC 
TCA TCA AAG GAA CCG GAT AAA ATA GTA TTC CAT GTG GTG ACA GAT TCA CTC AAT 
TAC CCA GCA ATC TCA ATG TGG TTT TTA CTA AAC CCA AGT GGC AGA GCT TCA ATC 
.CAA ATG CTA AAC ATT GAT GAA ATG AAT GTC CTG CCA. TTG TAC CAT GCT GAA TTG 
CTG ATG AAG CAA AAT TCA AGT GAC CCA AGA ATC. ATT TCA GCG CTC AAC CAT GCA 
GGC TTC TAT CTC CCA GAT ATC TTC CCA 6GT CTA AAC AAG ATC GTA CTC TTC GAT 
CAT GAT GTA GTA GTG CAA AGG GAT. CTA ACT AGA CTG TGG AGC CTT GAT ATG ACG 
GGG AAA GTT GTT GGA GCT GTA GAG ACT TGT CTT GAA GGT GAT CCT TCA TAT CGT 
TCG ATG GAC TCA TTC ATT AAT TTC TCA GAT GCA TGG GTT TCT CAG AAA TTT GAT • 
CCC AAG GCT TGC ACT TGG GCA TTC GGG ATG AAT CTA TTT GAT CTC GAA GAA TGG 
AGA AGA CAG GAG TTG ACT TCT GTA TAC CTG AAA TAC TTC QAC CTG GGA GTA AAA 
GGA CAT . CTG TGG AAA GCA GGG GGA TTG CCA GTA. GGT TGG TTG ACT TTT TTC GGG . 
CAA ACG TTT CCG TTG GAA AAG AGA TGG AAC GTG GGT GGG TTA GGT CAC GAA TCA 
GGA CTC AGG GCA AGC GAC ATC GAA CAA GCA GCG- GTT ATA CAC TAC GAC GGG ATC 
ATG AAA CCA TGG .CTG GAC ATC GGT ATA GAC AAG TAC AAG C6C TAC TGG AAC ATA 
CAT GTA CCT TAC CAT CAC CCT CAC TTA CAA CGG TGC AAC ATT CAC GAT TGA 



Amino Acid Sequence of Sequence #7: • 
Genebank ID# NP_671901 
Positions 1-610. 

mQVRRWQRILILSLLIiLSVLAFIVFVSNRI/KSITSVDRGEFIEELSDITDKTEDELRLTAIEQDETC 

PKRILQDRDFNSWLSNSSi:>KSNDTVQSNEGDQKNPLSEVDKGNN^ 

DIQLNHKTEFRPPSSKSEKNTRVQLERATDERVKEIRDKIIQAKAYLNLALP 

ATGDTTKDKYLPKSSPNRIiKAMEVALYKVSRAFHNCPAIATKLQ 

TPKGI.HCLSMRLTTEYFTtlDHEKRQLLQQSYNDPDLYHYVVFSDN^nLASSV^ 

TDSl^PAiSMWPIJIllJPSGiV^IQIIlNIDEM^n/LPLYHAELL^^ 

KIVLFDipVWQRDLTltljWSLDmK3KWGAVETCI.B6DPSYRSMDSFINFSnAWVSQK 

LFDLEEWRRQEtiTSVYLKYFDLGVKGHLWKAGGLPVGWLTFFGQT^ 

AVIHYDGIMraWLDIGiPKYKRYWNIHVPYHHPHLQRCNiHD. 



Sequence #8 

Gene name: At2g46480 • . 

GeneBank accession # for reference: N)yL-130212 .2 ♦ GIi22326493 * 

BTucXeotide sequence of Sequence #8: 
.Positions 1-1587. . ' ' . ' - 

*MG ACT GAT OCT. TGT * TGT TTG AAG GGA AAC GAG GAG AAA ATG GTT COT CGT TTT • 
GGT CAT GGA ACC TGG ATA GGA AAA GCA . TTT AAT GAT ACA.CCA GAG ATG TTG CAT 
' GAA AGG AGT CTG AGA CAG GAA AAA AGA TTG jGAA AGG GCT AAT GAG CTG ATG AAT 
GAT GAT AGT CTG CAA AAG CTT GAG* ACG GCA GCC ATG GCA CGT TCC AGA TCT GTC 
GAT*TCT.,GCA CCA CTA GGA AAC TAC ACC ATT TGG AAA AAT GAA TAC CGG AGG GGC 
AAG AGT TTT X^Ml GAT ATG TTA CGT TTG ATG CAA GAT GAA ATG ATC' ATG' GCA CG^ 
GTT TAC AGT- GGA CTT GCA AAG TTT ACA AAQ AAT CTC GCC TTG CAC. CAA GAG ATA 
GAA ACA CAA CTA ATG AAA CTA GCT TGG GAG GAA GAA TCT ACT GAT ATT GAT CAG 
GAG CAG AGA GTA CTT GAC AGT ATA AGA . GAC At<3 G6A CAA ATA CTG GCT AGA GCA 
CAC GAG CAG CTA TAT GAA TGC AAG TTG GTG ACA AAT. AAG TTG AGA GCA ATG CTA * 
CAA ACA' GTT GAA GAT GAA CTC GAA AAC GAG CAG ACT- TAT ATA ACG TTC TTG AGT 
CAG CTA GCT TCC AAG GCA CTA CCA GAT GCT ATC CAC TGC TTG ACC ATG CGC- TTG 
AAT CTA GAG TAT CAT CTC CTG CCT TTA CCG ATG AGA AAT TTT CCA AGG AGG GAG 
AAT TTG GAG AAT CCA AAA CTT TAC CAC TAC GCT CTC TTC TCT GAT- AAT GTA CTG ' • 
GCT GCA' TCA GTT GTT GTC AAC TCC ACA GTC ATG AAT GCA CAG GAT CCT TCA AGG 
CAT GTT TTC* CAC CT* GTG ACT GAT AAG CTC AAC TTT GGA GCA ATG * AGT ATG TGG ' 
TTT CTG TTG AAC CCT CCT GGA GAA GCG- ACC ATC CAT GTC CAA AGG TTT GAA GAT 
TTT ACT TGG CTC AAC TCA TCT TAC TCT CCA GTT TTG AGT GAG CTC GAG. TCA "GCA 
GCT ATG AAG AAG TTC TAC .TTC AAG ACA GCG AGG TCT GAA TCA GTT GAA TCA GGC 
'.TCA GAA AAC CTC AAG TAC CGG TAC CCG* AAA TAC ATG TCA ATG CTT AAC CAC CTG 
AGG TTQ ' TAC ATC CCT AGG ATC TTC CCA AAG TTG. GAG AAA ATC TTG TTT GXT GAC 
GAT GAT GTG GTT GTT CAG AAG GAT TTA ACT CCC CTA TGG TCC ATT GAT CTT AAA 
G6G AAA GTG AAT GAA AAC .TTT GAT CCC AAG TTC .TGC GGA TGG GCT TAT GG6 ATG 
AAC ATC TTC GAC- CTG AAA GAA TGG AAG AAG AAC AAC ATT ACA GAA ACT. TAT CAC 
TTT TGG CAA AAC CTG AAC GAA AAC CGG ACT CTA TGG AAA CTA GGA ACA TTG CCA 
CCA GGG CTC ATA ACG TTC TAC AAT CTG" ACA CAA CGA CTT CAG AGA AAA TGG CAC 
TTA CTT "GGA CTG GGT TAT GAT AAA GGA ATC GAT GTC AAG AAG ATT GAA AGA TCA 
GCT GTT ATA CAT TAC AAT GGA CAC ATG AAA CCA TGG ACA GAG ATG GGG ATA AGC 
AAG TAT CAG CCA TAT TGG ACG AAG TAC ACC AAT TTT GAC CAT CCT TAC ATC TTT' 
ACT TGC AGG CTG TTt GAG TGA ' ' * * . • 



Aiqino Ac±d Sequence of Sequence #8: ' . 
Genebank ID# NP_182171 . | * 

Positions 1-528.- • * • . 

MTDACC3:.KGNEDKMVPRFGHGTWIGKAFNDTPEM^ 
VDSAPl^GKYTIWKNEYiOlGKSFEDMIJUiMQDQIIMARVYSGr^ 
DQEQRVia)SIRDMGQiriARAHEQI.YECKlOT!NKI^^ 
MR3LNLEYHLLPLPMRNFPRRENIiENPKr.YHY^^ 

MSMWFLLNPPGEATIHVQRFEDFTWLNSSYSPVLSQLESAAMKKFYFKTARS 
MTJIHLRFYIPRIFPKLEKILFVDDDVWQKDLTPLWSIDLKGKVNENFDPKFCGW 
ITETYHFWQJSnt-NENRTIiWKLGTtiPPGLITFyNLTQPLQRiafra 
WTBMGISKrQPYWTKYTNFDilPYIFTCRLPE 



Sequence #9 ' • 

Gene name: At3g01040 • 
GeneBank accession # for reference: ♦ NM__110969 . 1 01:18395686 

Nucleotide sequence of Sequence #9: • • 
Positions 1-1533. ' ' 

ATG AAG ATC AAA GTC GCA OCT CGT CAC ATC TCT TAG CGA ACT CTC TTC CAC ACT 
ATC TTA ATC CTC GOT TTC TTG TTA CCT TTT GTT TTC ATC CTA ACC GCT GTT GTT 
ACC CTT GAA GGT GTC AAC AAG TGC TCC TCT TTT GAT TGT TTC GdG AGG CGG' CTA 
GGA CCA CGT CTT CTT GGT AGG ATA GAT GAT TCA GAG CAG A6A COJA GTT AGA GAT^ 
TTT TAC AAA ATT CTA AAT GAA GTA A6C ACT CAA GAA ATT CCA GAT GGT TTA AAG 
CTT CCA. GAG TCT TTT* AGT CAA CTG GTT TCG GAT ATG AAG AAC AAC CAC TAT GAT 
GCT AAA ACA TTT GCC CTC GTA TTT CGA GCT ATG GTA GAG AAG TTT GAA AGG GAT 
TTA AGG GAA TCC AAA TTT GCA GAA CTC ATG AAC AAG CAC TTT GCT jSCA AGT TCA- 
ATT CCA AAA GGA ATT CAC TGT CTC TCT TTA AGA CTA ACC GAT GAA TAT TCC TCC 
AAT GCT CAT GCC CGG AGA CAG CTT CCT TCC CCG GAG CTT CTC CCT GTT CTg TCA 
GAC AAT- GCT TAC CAC CAT TTT GTT CTA GCT ACA GAT AAT ATC TTA GCT GCA TCG 
GTT CTG' GTC TCA TCT QCT GTT CAA TCA TCT TCA AAA CCC GAG. AAA ATT GTC TTC 
CAT GTT" ATC* ACA GAC AAG AAA ACC .TAT GCG GGT ATG CAT TCT TGG TTT GCA CTC 
AAT TCT .GTT GCT CCT GCG ATT GTT GAA GTG AAA AGC GTT CAT* CAG TTT GAT TfeG 
TTA ACA AGA GAG AAT GTT CCA GTT CTT GAA GCT GTG GAA AGC CAT AAC AGO* ATC 
.AGA AAT TAT TAC CAT GGG AAT CAT ATT GCT, GGT GCA AAC CTC AGC GAA ACA ACC 
CCT CGA ACA TTT GCT TGG AAA CTG CAG TCA AGA AGT CCC AAA TAC ATA TCT 'TTG 
CTC AAC CAT CTT AGA ATA TAT CTA CCA GAG CTT TTT CCG AAC TTA GAC AAG GTA! 
GTG TTC TTA GAT GAT GAT ATA GTG ATA CAG AAA GAT TTA TCT CCG CTT TGG ' GAT 
ATT GAC CTT AAC GGG AAG . GTT AAT GGA GCT GTG GAG ACT TGT CGA GGA- GAA QAC 
GTA TGG GTT ATG TCA JlAG CGT C^T AGG AAC TAC TTC- AAT TTT TCT- CAC CCG CTC. 
' ATC GCA AAG CAT TTA GAT CCC GAA GAA' TGT GCT TGG GCT TAT GGA- ATG AAT ATC 
TTT GAT^CTA CGG ACT TGG AGG AAG ACA AAT ATC AGA GAA ACG TAT CAT TCT TGG 
CTT AAA GAG AAT CTG AAG TCG AAT CTA ACA ATG TGG AAA CTT GGA ACA TTG CCT 
CCT GCT CTA ATA GCA TTT AAA . GGT CAT GTT CAG CCA ATA GAT .TCC TCT TGG CAT 
ATG CTT GGA TTA GGT TAT CAG AGC AAG' ACC AAC TTA GAA AAT GCG AAG AAA GCT 
GCA GTG ATT CAT - TAC AAT GGC CAA TCA AAG CCG TGG CTT GAG ATA GGT TTC GAG 
CAT dTC AGA CCA TTC TGG ACA AAA TAT GTT AAC TAC TCC AAT GAT TTC ATT AAG 
AAT TGT. CAT ATC TTG GAA TAG 



Amino Acid Sequence of Sequence #9:. ' . . 
Genebank ID» NP_186753 

Positions 1-510. • . 

MKIKV^UVRHISYRTLFHTILILAFLIiPFVFILTAVV^ 

DFYKILNEVSTQEI PTCLKLPESF^QLV SDMKNNHYDAKTFALVFRAMVEK^ 
SSIPK6IHCLSlJRLTOEYSSNAI£^RRQLPSP^IltlPVLSDNAYHI^^ 
IVFHVITDKKTYAGMHSWFArJbJgVAPAiyEVKSVHQFDWLTRENW 
SETTPRTFASKLQSRSPKYISnLiraLRIYLPEliFPNriDKWFL^ 
CRGEDVWVMSKRLm^YFNFSHPLIAKHLDPBECAWAYGMNIFDIiRTWR^ 
KLGTItPPALlAFKGHVQPIDSSWHMLGLGYQSKTNI^ 
YSNDFIKNCHILE 



Sequence #10 

Gene name: At3gO23 50 
GeneBank accessio.n # for reference: 
' ' * . . • 

Nucledtide sequence of Sequence #10: 
Positions 243-1928, 

ATG GCG GTG GCC TTC CGT ^GGA GGC CGG GGA. GGC GTQ GGA TCC GGC CAA TCT ACC 
GGA CTT CGT AGT TTC TTC TCC TAG CGG ATC TTT ATC TCC GCT TTG TTC TCT TTT 
CTC TTC CTC GCC ACT TTC TCC GTC GTT CTT AAC TCC TCT CGT CAT CA6 CCT CAT* 
. CAG GAT, CAT ACA TTG CCG AGT ATG GGC .AAC GCA TAT ATG CA^ AG6 ACG TTT TTG 
GCT TTG CAA TCG.GAT CCA TTG AAA ACT AGG TTG GAT CTG ATA CAC AAG CZIA GCC 
ATT GAT CAT. TTG ACA CTG GTG AAT GCG TAT GCT GQT TAC^ GCT AGG AAG CTA AAG 
CTT GAT . GCT TCT AAG. CAG- CTT AAG CTC TTC. GAA GAT TTG GCT ATC AAC TTC TCG 
GAT TTG CAG TCG AAA CCT GGT TTG AAA TCT GCT GTG TCT GAT AA5? GGT AAT GCT 
CTT GAG. GAG GAT TCG TTT AGG CAG CTT GAG AAA GAA GTG AAG GAT AAG iSTG AAG 
ACA GCG AGG ATG ATG ATC GTT. GAG TCT AAA GAG AGT TAT GAT ACA CAG CTT AAA 
ATC CAG AAG TTG AAA GAT ACA ATC TTT. GCT GTC CAA GAA CAG TTG ACA AAG GCT 
AAG AAA, AAC GGT GCG GTT GCT *AG.C'TTG ATT TCA GCC AAG TCG GTT CCT AAA AGl? 
CTT CAT TGT TTG GCC ATG AQG- CTT GTA GGA GAG ,AGG ATC TCT AAT CCT GAG AAG 
TAC AAG GAT GCT CCA CCT GAC CCA GCC- .GCA GAG GAT CCA- ACT CTT -TAC CAC TAT 
GCG ATT TTC- TCT GAT AAT GTC ATT; GCT GTG TCT GTT GTG . GTG AGA TCG GTT GTG 
ATG AAC GCT GAG GAG CCA TGG AAG CAT. GTC TTC CAT GTG GTG* ACA GAT CGG ATG • 
AAT CTC GCA GCC ATG . AAG' GTG TGG TTT AAG .ATG CGT. CCT TTG GAC CGT GGT GCC 
CAT GTT GAG ATT AAA TCC GTG GAG. GAT TTC AAG TTC" TTA AAC TCT TCC TAT GCG 
CCG GTC. TTG AGG CAG CTT. GAG TCT GCC AAG TTG CAG AAG TTT TAC TTT GAG AAT 
CAA GCT -GAG AAC GCA ACT AAA GAT TCA CAT AAC CTC AAG TTC AAG AAC CCC AAG 
TAT CTC .TCG ATG TTG AAC CAT CTC- AGA TTT TAC TTA CCA GAG ATG TAT CCG AAG- 
CTG AAT AAG ATT TTG TTC TTG^GAC GAT GAT GTT GTG GTG CAG AAA GAC GTG ACT 
QQrp rjyj.^ rj^QQ AAC TTG GAT GGC AAG GTG AAT GGA GCC GTT* GAG ACA TGT 

TTT GGT TCT TTT CAT QGA TAT GGT* CAA TAC TTA AAC TTC T?:T' CAT CCT TTG ATC . 
AAA GAG AAC TTT AAC CCC AGT GCC TGT GCT TGG GCC TTT GGA ATG AAC ATA TTC 
.GAT CTC AAT GCC TGG AGA CGQ GAG AAG TGC ACC GAT CAA TAC CAT TAC TGG CAG 
AAC CTG AAT GAA- GAC AGA ACT CTC TGG AAA' TTG GGA ACT CTA CCT CQG GGA TTG 
ATC ACA TTC TAT" TCA AAG ACG AAA TCA TTG GAC AAA' TCA TGG CAT 6TA CTT GGG * 
TTA GGC TAT -AAC- CCG GGA GTG AGG ATG GAC GAA ATC AGA AAT GCA GGA GTG ATT . 
CAT TAC AAT GGA AAC ATG .AAA* CCG TGG CTA GAC ATT GQG ATG AAC CAA TAC AAG 
TCT CTC TGG ACT AAA TAT GTT GAT AAC- GAA ATG GAG TTT GTG CAG ATG TGC AAT 
TTT GGT' CTC TAA ' . ' 



AY056202 
61:1581032$ 



Amino 'Acid Sequence of Sequence #10: 
Genebank ID#. NP_566176 
jpositions 1-561 ♦ 

MAVAFRGGRGGVGSGQSTGLRSFFSYRIFIsiu^SFLFLATFSVVLNSSRHQPHQbHT^ 
NALEEDSFRQLEKEVKDKVK^ARMMIVESKESYiyrQLKIQKL^^ 

PKSLHmAMRLVGroiSNPEKYKDAPPDPAAEDPTLYHYAiFSDWIAVSVWRSVVM^ 

TDimWIJUUlKVWFKMRPLbRGAH^ 

FKNPKYLSMLNHLRFYLPEMYPKLNKILFLDDD^ 

PSHPXiIKENFNPSACAWAFGMNIFDLNAWRREKCTDQyHYWQNLNEDRTLWKLGT^ 
KSWHVCX^^YNPGVSMDBXRNAGVXHYNGNMkPWLDIAHE?^ . 



Sequence #11 • . 
Gene name: at3g25140 

GeneBank accession # for reference: NI1_113418.2 GX:22331320" 

Nucleotlde sequence of Segpience #11: 
Positipns 38-1717. 

ATC GCT 'AAT CAC CAC CGA CTT TTA .CGC GGC GGC GGA TCT CCG GCC ATA ATC G<3T 

• GGC AGA ATC ACA CTC ACA GCT TTC GCT TCC ACT ATC GCA CTC TTC CTC TTC AQT 
CTC TCC TTC TTC TTC GCT TCA GAT TCT TIAC GAT TCT CCT GAT CTC CTT CTT CCC 
GGT GTT GAG TAC .TCT AAT GGA GTC GGA TCT AGA AGA TCQ ATG TTG GAT ATC AAA 

. TCG GAT .CCG CTT AAG CCA .CGG TTG ATT CAG ATC CGG AAA CAA GCT 'GAT GAT CAT 
.CGG TCA TTA GCA- TTA GCT TAT GCT TCT Tkc GCG A6a AAG CTT AAG CTC GAG AAT 
TCG- AAA CTC' GTC AGG ATC TTC- GCT GAT CTT TCG AGG AAT TAC ACG GAT CTG ATT 
AAC AAA CCG ACG TAT'CGA GCT TTG TAT GAT TCT. GAT GGA GCC TCG ATT GAA GAA ' 
TCT GTG CTT AGG CAA TTT GAG AAA GAA GTT AAG GAA CGG ATT AAA ATG ACT CGT 
CAA GTG ATT GCT GAA GCT - AAA GAG TCT TTT GAT AAT CAG TTG AAG ATT. CAQ AAG 
CTG AAA GAT ACG ATT TTC GCT GTT AAC GAA CAQ TTA ACT AAT GCT AAG AAG CAA 
GGT GCG TTT TCG AGT TTG ATC GCT GCG AAA TCG ATT CCG AAA GGA TTG CAT TGT ■ 
CTT GCT^ATG AGG CffG ATG GAA GAG AGG ..ATT GCT CAC CCT- GAG AAG TAT ACT GAT 

• GAA GGG^AAA GAT AGA GCG* CGG .GAG CTC GAG GAT - CCG AAT CTT TAC CAT TAC GCT 
ATA TTT TCG GAT AAT GTG ATT GCG GCT TOG GTG GTT GTG AAC TCT GCT GTG AAG * • 
AAT GCT AAG. GAG . CCG TGG AAG CAT GTT TTT CAC GTT GTG ACT GAT AAG ATG AAT- 
CTT, GGA GCT ATG CAG GTT ATG TTO?- AAA CTG AAiG GAG. TAT AAA GGA GCT CAT GTA 
GAA GTT AAA GCT GTT GAG GAT TAT ACG TTT TTG AAC TCT TCG TAT GTG CCT GTG 
TTG AAG CAG TTA GAA TCT GCG AAT CTT CAG AAG - TTT TAT TTC GAG AAT AAG CTC 
GAG AAT GCG ACG AAA GAT. ACC ACG AAT ATG AAG TTC AGG AAC CCC AAG TAT TTA 
TCT ATA TTG AAT CAC TTG AGO TTT TAT TTA CCC GAG ATG TAC CCG AAA CPA <?AT 
AGG ATA- CTG TTT TTG GAC GAT GAT GTG GTT GTG CAG AAG GAT TTA ACG G(?T CTG' 
TGG GAG ATT GAT ATG GAT GGG* AAA GTG AAT GGA GCT GTA GAG ACT TGT TTT GGG 
TCG TTT CAT CGG.TAC GCT CAA TAC ATG AAT TTC TCA" CAT CCT TTG ATC AAA GAG .'^ 
AAG TTT AAT CCC AAA GCA TGT GCG TGG GCG TAT. GGA ATG AAC TTC TTT. GAT CTT " 
GAT GCT TGG AGA AGA GAG AAG TGC ACA GAA GAA TAT CAC TAC TGG CAA AAT CTG 
AAC GAG AAC AGG GCT CTA TGG AAA . CTG GGG ACG TTA CCA CCG GGA CTG ATC ACC 
TTT TAC TCA ACC ACA AAG CCG CTG GAC AAA TCA TGG CAT GTG CTT GGG CTG GGT 
TAC AAT CCG AGC ATT AGC ATG GAT GAG ATC CGC AAC GCT GCA GTG GTA CAC TTC • 
AAC GGT AAC ATG -AAG CCA TGG CTT GAC ATA GCT ATG AAC CAG' TTT CGA qcA CTT 
TGG ACC AAA CAC GTC GAC TAT. GAC CTC GAG TTT GTT CAG GCT TGC AAT TTT GGC 
CTC TGA • 



Amino Acid Sequence of Sequence #11 r 
G^nebank ID# NP_1 89150 
Positions 1-559-. 

MANHHRIirj^GGGSPAIIGGRITLTAFASTIALFLFTLSFFFASDSNDSPDLlilj 

KSDPIiKPRLXQIRK;QADDHRSIArjAYASYARKLKLENSKI,WIFApLSRNYTO 

EESVLRQPEKEVKERIKMTRQVIAEAKESFDNQLKIQKLKDTIFAVNEQLTN^ 

LHCIiAMRLMEERIAHPEKYTDEGKDRPRELEDPNtYHYMFSDWIAAS 

DKM^^;lGAMQVMFKLKEYKGAHVEVKAVEDYTFLNSSYV^ 

NPKYLSILNHLRFYLPEMyPKLHRILFIiDDDVWQKDLTGLWEIDMDGI^ 

HPLIKEKFNPKACAWAYGMNFFDLDAWRREKCTEEYHYWQNLN?^^ 

WHVIjGLGOTPSISMDEIRNAAVVHFNGNMKPWLDIAMNQFRPLWTKH\mTO 



• SQQpience #12 

Gene name: At3g58790 • 

•GeneBank accession # for reference: Blll.115741.2 GI: 223 3185 6 

Nuqleotide sequence of Sequence #12 : 
Positions 353-1975, 

ATG AAG TTT TAG ATA TCA GCG ACG GGG ATT AAG AAG GTT ACG ATA TCA AMT CCC 
GGC. GTC GGA ATC GGT AAA GGA AGC GGA GGA TGT GCG GCT GCA GCG GCG GCC TTA 
GO A GCG CGG AGA .TTC TCT AGT CGC ACG TTG TTA CTG TTG CTG CTG CTG CIC GCT 
ATC GTC CTC CCT TTT ATC TTC GTC AGG TTC GCG TTT "CTC GTC CTC GAA TCJT GCC 
TCC GTT TGC.GAT TCA CCA CTC GAT TQC ATG GGA CTC AGA CTT TTC CGT GGC GGC 
GAG ACA TGT CTG AAA ATT GGG GAA GAG TTG ACA CGG GGT CTA GTG GAA GPG ACG 
AGA GAT GAT.CAG GAC GTT AAT GGA AGA 'GGA ACG AAG- GGA TCA TTG GAG TCa TTC 

* GAC GAC CTT GTT AAG' GAG ATG ACG. TTA. fkAA CGC CGT GAC ATA AGG GCG TTE* GCT 
TCC GTG. ACT AAG AAG ATG CTG TTG CAG ATG GAA. CGT AAA GTC CAA TCA GCG AAA 
CAT CAT GAG TTA GTG. TAG TdG CAT. TTA GCC TCT CAC GGT ATT CCT AAA jftOC CTC 
CAT TGC .CTT TCC CTC. AGA TTA ACT GAA GAG TAG TCT GTA AAT GCA AtG GCff' CGA 
ATG CGT TTG CCT CCG CCT GAG TGC GTA TCA CGT CTG ACG GAG CCA TCT THP CAT 
CAT ATT GTG CTC CTG ACT GAC AAT GTG GTT GCT GCC TGT GTG GTC ATA TO5 TCT 
ACT GTA GAA AAG GCT GTG AAT CCG GAG AAG TTT GTG TTT CAT ATT GTT AO: GAT 
AAG AAA AGC TAT AdC CCT ATG GAT GCT TG6 TTT GGT ATG AAG TCT GCT TCSL TCA 
CCA GTT GTT GAA GTA AAG' GGA- CTT CAT GAG TAT CSAT TGC CCT CAA GAA CTB AAG 
TTC AAA GTT AGA GAG ATG CTG GAG ATT GAC CGC TTA ATT TGG AGA GGA CflflP TAT 
CAA AAT TTG AAA GAC TCT GAT TTT- AGT TTT GTT GAG GGT ACT CAT GAG CftS TCC 
TTG CAA GCT CTA AAT CCT AGC TGC CTT GCC CTT TTG AAG CAT CTT GGG ATP TAG 
ATT CCC AAG CTT TTT CCA GAT" CTC AAG AAG ATA GTG TTG TTG GAT GAT GST GTA 
GTA GTA GAG AGG GAT CTT TGG TGT TTA TQG GAA ACG GAT CTC AAC;G6T 'aA& GTT 
GTT GGT GCT GTC GTT .GAT TGG TGG -.TGC GGA GAC AAG TGT TGC. GCC (3GA Afi& AAA 
TAG AAA .GAC TA* TTG* AAG .TTG TCA CAT CGT TTG ATC TCA TCA AAC TTA GTT CAA 
GAA GAC TGT GCT TGG CTT TGT GGT ATG AAT GTC TTT GAT CTC AAA GCC TGB AGA 
CAA ACC AAT ATT ACT GAA GCT TAG TCT AGA TGG CTA AGA CTC AGT GTT AGS TCA 
GGA CTA CAA TTA TGG* CAA CCA GGG GCT TTfA CCA CCG ACA TTA CTT GCT TTC AAA ' 
GGA CTT ACA CAG TCT CTT GAA CCA TCA TGG CAC GTC GCT GGA CTA GGT TCP CGA 
TCC GTA AAA TCC CCT. CAA GAG ATT CTG AAA TCT GCT TGG GTT TTA' CAT TTC AGC 
GGT CCA GCA AAA GGG TGG CTA GAG ATC AGT AAC qCT GAG GTA CGA TCT CTP TGG 
TAT AGA TAG GTA AAT TCC TGG GAC ATG TTC GTT AGA AAA TGC MA ATC ATG AAC 
TdA • • • • • .• 



Amino Acid Sequence of Sequence #12: 
Genebank ID# NP_191438 
Positions 1-540., 

.;MKFYISATGIKKVTi;5NPGVGIGKGSGGCAAAAAAIJ^ARRFSSRT 
ASVCDSPIiDCMGLRljFRGGDTSIiKIG^IiTflALVBETTDHQDVNGRGTK^ 
FASVTKKMLLQMBRKVQSAiaiHELVyWHrASHGIPKSLHCL.SLRLTE^ 
SFHHIVLIlTDNVLAASWISSTVQNAVNPEKFVFHIVTDKKTYTPMHAWFAINSASSPV^^ 
QEVNFKVI^iyn^DIHiailWRRHYQI^KDSDPSFVEGTHEQSLQAIiNPSCtJ^ 
LDDDVWQSDLSSI*WETDLNGKWGAWDSWCGDNCCPGRKYB3)YFNFSHPLISSI^ 

•DliKAWRQTNITEAYSTWLRLSVRSGLQLWQPGALPPTLIAFKGLTO 
A^VLHFSGPAKPWLBISNPEVRSLWYRYVNSSDIFVRKCKIMBr 



' Sequence #13 

. Gene name : At:4g48270 . 
GeheBank accession # for ' reference: N^_119989.1 GX: 18420208 

Nucleotide sequence of Sequence #13: 
Positions 1-1977, 

ATG TCG ACQ ATT TGC AGC CAC AGA GAA CTT AAA GCT TAT CGT CCG CTG CM GAE 
AAT AAT CTA CAd GAG GTG TAT GCT TCC TCA GCT GCT 6CA GTG CAC TAC GffiT CCR 
GAT CTG AAA.CTG' TTA TCT CAG GAT GTG.^lAC ATA GTT GCG ACA TAC AGT. GSC CXS 
TAG GGC AAT ATA CGC CTT GGT AGG GTG AAA ATG GGG GAT CTT TCA. GCT. TCT TGC 
GTT TTG GAG AAT QCT GCC TAT CAA GTT AGC CGC AAA - ACA AAA GGT TCG CJftS CTSt ' 
GTT ATA CCA CGG GAT TCA TTT CAA AAT GAT ACT GGA ATG. GAA GAT AAT GCA AGC 
CAT TCT ACA ACT AAT CAG ACT GAT GAA AGC GAA AAT CAG TTT CCA AAC GTO GASr * 
TTT GCA AGC CCA GCA AAA CTG AAG CGG CAQ ATT TTA CGT- CAG GAA AGG AGA GOT 
CAA CGA ACT TTA GAG CTG ATC CGA CAA GAA AAG GAA ACT GAT GAG CAG AIG CAft 
GAA GCA GCC ATT CAG AAG TCA ATG AGC TTT GAA AAC TCA GTC.A'i'A QGG AAA - TAC • 
AGT ATA TGG AGG AGA GAC TAT GAG AGO CCA AAT GCT GAT GCT ATC TTG AS© 
ATG AGA GAC CAG ATC ATA ATG GCA AAA GCA TAT GCA AAT ATT GCC AAA TC^ AAfi: 
AAT GTA ACC AAT CTG TAC GTT TTC TTG ATG CAG CAQ TGT GGA GAA AAT AAft .CGP 
GTT ATA GGT AAA GCA ACC TCT GAT GCT GAC CTT CCT TCA AGC GCT CTT GftO?. CAa ' 
GCA AAA GCC ATG GGC CAT . GCA CTC TCT CTT GCA AAA GAC GAG TTA TAT GAC TGC 
CAT GAA CTT GCA AAA AAG TTC .CGG GCjC ATC CTT GAG TCC ACT GAA CGC AAA Gm 
GAT GGA CTG AAG AAA AAG GGA ACC TTC TTA ATT CAG GTA * GCT GCQ AAA ACSk TTIf 
CCC AAG CCA TTG CAT TGC CTG AGT "CTG CAG CTA GCG 6CA GAC TAT TTT AlftP Cm ' 
GGT TTC AAT GAA GAG GAT GCA GTG AAA GAG GAT GTC^AGT 'CAA AAG. AA6 CTT GA& 
GAT CCT TCG CTC TAT. CAC TAT GCG ATC TTT TCG GAT AAC GTT" CTG" GCT ACa TCA 
iSTG 'GTG GTG AAC TCC ACT GTC TTG AAT GCA AAG GAA CCG CAG AGQ CAT GTXS' TTC 
CAT ATA GTA ACT GAC AAA CTG AAT TTT GGT GCA ATG .AAG ATG TGG TTT CGC ATC 
AAT GCT CCT . GCT GAT GCG ACG ATT CAA GTT. GAA AAC ATA AAT GAT TTC AAG TGG 
CTG AAC TCC TCT TAC TGC TCT GTT CTA CGG CAG jCTT GAA TCT GCA AGG: CTG AAA 
GAA TAC TAT TTC AAA GCA AAT CAT CCT TCA TCA ATC TCA GCT GGC GCA GASir AAT 
CTA AAG TAC CGC AAC CCA' AAG TAT CTA TCG ATG. CTG AAT CAT CTC AGA TTC t'AC 
CTT CCT GAG GTT TAT CCG AAG CTG GAG AAG ATT CTG. TTT. CTA GAC GAT GAC ATT 
GTG GTG CAG AAG GAC CTG GCA CCA CTA TGG GAA ATA GAC ATG CAA GGA AAA GTC 
AAT •GGT .GCG GTG GAG ACG TGC AAG GAG AGC TTC* CAC .A(5A TTT GAC AAG TAC CTC 
AAC, TTC. TCA AAT CCA AAG ATT TCA GAG AAT TTT. GAC QCT QGT GCT TGT GGG TGG *** 
GCA .TTT GGG ATG AAT ATG TTT GAC CTG AAA GAG TGG AGG -AAA CGG AAC ATT ACA 
GGWUlJUUtnJinJtJUU5»w[S^*'yy;^ GAC TTG AAT GAA GAC AGA ACA CTG TGG AAG CTG 
GGA TCG. TTG CCA CCG GGG CTG ATA ACA TTT TAC AAC CTG.. ACG TAT GCA ATG GAT 
AgG AGC TGG CAC GTA CTA GGG CTG • GGA TAT GAC" CCA GCG CTA AAC CAA ACA GCA 
ATA GAG AAT GCA GCQ GTA GTG CAT TAC* AAT jGfGG AAC TAC AAG CGA TGG CTG GGT ' 
TTA GCA TTC GCC ^AG TAC AAA CCG TAC TGG TCC AAG TAC GTT. GAG TAC GAC AAC 
CCT TAT* CTC CGA CGG TGC GAC ATC AAT GAA TGA 



Amino Acid Sequence of SequLence #13: 
Genebank ID# NP_19S540 
Positions 1-658- 

MSTICSHRELKAYRPLQDNNLQE\^ASSAAAVHYDPDLldliSQDVNlVATYSDHY 
WVLENPAYQVSRKTKGSQLVIPRDSFQNPTGMEDNASHSTO 

RGQRTLELXRQEKETOEQMQEAAIQKSMSFENSVXGKYsbaRRDYESPNAbAXLKIjMR 
KSKlSnmjLVWLMQQgGENKRVIGKATSDADLPSSALDQAKAMGHAL 
.ERKVDGLKKkGtFLIQIJUVKTFPKPLHCLSLQLAADYFII.GPNEEDAVK'Ep 
VIATSVVVN31^NaKEPQRHVFHIWDid:iNFGAMK^^ 
ESARtiKEYYFKAiraPSSISAGAPNIiKYRNPKYLSMLNHLRFYIiPEVYPK^ 
IDMQGKVNGAVETOaBSFHRFDKYLNFSNPKJSEkFDAGACQWAFGMNlMFDrjK^ 
BDRTIiWKLGSLPPGLITFYNLTYAMDRSWHVLGLGYDPAIJ^ 
WSKYVEYDNPYLRRCDINE 



Sequence #14 ' • -. 

Gekie n^e: At5gl5470 • * * , • 

GeneBank accession # for reference:. * AY056202 . ' ' 

• . . • GIrl5810326 

Nucleotide sequence of Sequence #14:" .... 
Positions 1-1719. 

ATG GGG TCT GGG AGA AGA CCA CAA GGA AGA AGA ATA GCT ATA AGG AAC GAA iSSCA 
GAG *ACG GAA TTG' AGA TCG AGA ATC GGA GAG AAA TiJG GGA TCG GAG AAG CGG fiAT 
CTG ATC GGA TCT GAT GTT GGG ATG GAG CTT CAC ATA Tdc*. CCG AGT. ATG AGA fiGC 
ATT ACG ATT TCG AGC AGC AAT GAG TTT ATT GAC TTG ATG AAG ATC -'AAG' GTQ 6CA • 
GCT CGT CAC ATC TCT TAC CGA ACT CTC TTC CAC ACC ATC TTA ATC CTC GCT l?rc 
TTG TTG CCT TTT GTT TTC ATT CTC ACC QCT GTT GTT * ACC CTT GAG dcT GTC SAC 
AAA TGC TCC TCC ATT GGG AGG CGG . ATA GGT CCA CGT CTT CTT GGT AGG QTA SVT 
GAT TCA GAG AGA QTA GCT- AGA GAC TTT .TAT AAA ATT CTA .AAC GAA GTA AGC SCT 
CAA GAA ATT CCA GAT GGT TTG AAG CTT CCA AAT TCT JTT AGT CAA CTT GTT TB?C 
GAT. ATG AAG AA* AAC CAC TAT GAT GCA AAA AC& TTT* GCT CTT.GTQ CTG CGA jBSC 
ATG ATG GAG AAG TTT GAA CGT GAT ATG AGG GAA TCG AAA. TTT GCA GAA CTT BITG. 
AAC AAG CAC TTT GCA GCA "AGT TCC ATT CCC AAA GGC ATT CAT TGT CTG TCT CTA 
AGA dTG ACA GAT GAA TAT TCC TCC AAT GCT CAT GCT CGA AGA CAG CTT CCT 9K!A 
CCA GAG TTT CTC CCT .GTT CTT TCA GAT AAT GCT TAC CAC • CAC TTT ATT TTG TCC, 
ACG GAC AAT ATT. TTG GCT GCC TCA GTT GTG GTC TCA TCC GCT GTT CAG TCA VtT 
TCA AAA CCC GAG AAA ATT GTC TTT CAC ATC ATT ACA GAC AAG AAA ACC TAT ©CG* . 
GGT ATG CAT TCA TGG TTT GCG CTT AAT TCT QTT GGA CCA. GCA . ATT GTT GAG GTT • 
AAA GOT GTT CAT CAG TTT GAC TGG TTG .ACG AGA GAG AAT- GTT CCG GTT TTQ CSA 
GCT GTG GAA AGC CAT AAT GGT GTC AGG GAC' TAT TAT CAT GGG AAT CAT GTC CCT 
<3GG GCA AAC CTC ACC GAA ACA ACT CCT CGA ACA TTT GCT TCA* AAA TTG CAG TOT 
AGA AGT CCA AAA TAC ATA TCT TTG CTC AAC CAT CTT AGA ATA .TAT ATA CCA GSG 
CTT TTC CCG AAC TTG GAC AAG GTG GTT TTC - TTA GAC GAT GAT ATA GTT GTC CSG . 
GGA GAC TTA ACT QCA CTT TjGG GAT . GTT GAC CTC GGT GGT AA<3 GTC AA.T GGG . CTA 
GTA GAG ACT -TGP AGG GGT GAA GAT GAA TGG GTG ATG TCA AAG CGT* TTA AGG ASC 
TAC TTC AAT TTC TCT CAC CCG CTC ATC GCA AAG CAT TTA GAT CCT GAA GAA TEP 
GCT TGG GCA TAT GGT ATG AAT ATC TTC GAT CTA CAA GCT TGG ^GG AAA ACA SAT 
ATC AGA GAA ACG TAT CAC TCT TGG CTT- AGA GAG AAT CTA AAG TCA AAT- CTG- ACA 
ATG- TGG- AAA CTT GGA ACC TTG CCT CCT GCT CTT ATC GCG TTC AAG GGT - d AC Gm 
CAC ATA ATA GAC TCG TCA' TGG CAT ATG CTA GGa" TTA GGC TAC CAG AGC AAQ fiCC 
' AAC ATA GAA AAT GTG AAG AAA GCA GCA GTG ATC CAC TAC AAT GGG CAG TCA dBG 
CCA TGG CTG GAG ATT GGT TTC GAG CAT CTG CGG QCA TTC TGG AGC AAA TAC Gm 
AAC TAC TCA AAT GAT TTC ATC AAG AAC TGT CAC ATA TTG GAG TAG. ... 1. 



Aaiino Acid Sequmxce of Sequence #14: 
Genebank ID# NP^X97051 
Positions 1-572. 



MRSGRRPQGRRIAIRNET5TEDRSRIGEKSGSEKRNLIGSDW^ 

AARHISYRTLFHTILIIAPLLPFA^ILTAVVTIjEGVNKCSSIGRRZGP 

STQEXPTCLKLPNSFSQLVSDMKNNHYljAKTFALVLRAMMBIDTERDm 

liSLRLTDEYSSNAHMmQLPSPEFLPVLSDNAYHHFILSTDNIIJUVSVWSSAVQSSSKPEKIVFIiim 

KTYAGldlSWFALl^SVAPAIVEVKGVHQFDWi;jTREWPVLEAVESHNGVRDYYHGNW 

SKLQSRSPKYISLLNHEiRIYIPELFPNLDKVVFr^DDDIVyQiGDLTPLV^ 

SKRIiRI^FNFSHPLllUaiLDPEECAWAYGMI^lFDLQAWRKTNXRETYHSWL 

XAFKGHVHXlDSSWHMIiGLGYQSKTNlENVKKAAVIHYNGQSKPV^ 

HIIiE 



SeQcuence #15 

Gene name: At5g54690 

GeneBank access icm # £o2f reference: AY055202 

61:15810326. . 

Nucleotide sequence of Sequence #15:, 
Positions 1-1608. 

ATG CAG TTA CAT ATA TCT* CCG AGC TTG AGA CAT GTG ACT GTG GTC AC A GGG JSSSL 
GGA TTG AGA GAG TTC ATA AAA GTT AAG GTT GGT TOT AGA AGA TTC TCT TAT C8& 
ATG GTG TTT TAC TCT CTA. CTC TTC TTC ACT TTT CTT CTC CGA TTC GTC TTT GBT 
CTC TCC. ACC. GTT GAT ACT.ATQ GAC GGC GAT CCC TCT CCT TGC TCC TCT CTT GG^ 
TGC TTG GGG AAA AGA CTA AA6 CCA AAG CTT TTA GGA AGA AGG GTT GAT TCT GCaP 
AAT QTT CCA GAA GCT ATG TAC CAA GTT TTA GAA CAG CCT TTA AGC GAA CAA ' Gm 
CTC AAA GGA AGA TCA GAT ATA CCT CAA ACA CTT C!AA GAT TTC ATG TCT GAA GBET 
AAA AGA AGC AAA TCA GAC GCA AGA GAA TTT* QCT CAA AAG CTA AAA GAA ATG GfflS 
ACA TTG ATG GAA CAG AGA ACA AGA ACG .GCT AAG ATT CAA GAG TAT TTA TAT Ciffi 
CAT GTC GCA TCA AGC AGC ATA CCG AAA CAA CTT CAC TGT T^ A GCT CTT AAA Cm 
GCC AAC GAA CAC TCG ATA AAC GCA GQG GCG, CGT CTC CAG CTT CCA GAA GCT Gffi 
CTT GTC CCT ATG TTG GTA GAC AAC AAC TAC TTT tAC. TTT GTC TTG GCT TCA GMT 
AAT- ATT. CTT GCA GCT. TCG GTT GTG GCT AAG TCG TTG GTT CAA AAT GCT TTA Affi 
CCT CAT AAG ATC. GTT CTT CAC ATC ATA ACG GAT AGG AAA ACT TAT TTC CCA ATO 
CAA GCT TGGJTTC TCA. TTG CAT CCT CTG TCT CCA GCA ATA ATT GAG GTC AAG GOP 
TTG CAT GAT TTC GAT TGG TTA TCG AAA GGT AAA GTA CCC GTT TTG. GAA GCT A^ 
GAG" AAA QAT CAG- AGA GTG AGG TCT CAA TTC AGA GGT; GGA TCA TCG GTT ATT GTO 
GCT AAT AAC AAA GAG. AAC CCG GTT GTT GTT' GCT GCT AAG TTA CAA GCT CTC AGC 
CCT AAA TAC AAC TCC TTG ATG AAT CAC/ ATC CGT ATT CAT CTA CCA GAG TTG TiaT- 
CCA AGC* TTA AAC AAG GTT .GTG TTT CTA -GAC GAT GAC ATT GTG ATC CAA ACT GM 
CTT TCA CCT CTT TOG GAC ATT GAG ATG AAT GGA'AAA GTA AAT GGA GCA GTG QJ& 
*ACA TGT AGA GGA GAA GAC AAG TTT GTG ATG TCA*AAG AAG TTC AAG AGT TAC CTC 
. AAO TTC TCG AAT CCG ACA. ATT GCC AAA AAC TTC. AAT CCA GAG GAA TGT GCA TGB 
GCT TAT GGA - ATG AAT GTT TTC GAC CTA GCG GCT TGG AGG AGG ACT AAC ATA AGC 
TCC' ACT TAC TAT CAT TGG CTT GAC GAG AAC TTA AAA TCA GAC CTG AGT TTG TOS 
CAG CTG GGA ACT -TTG CCT CCT GGG CTG. ATT- GCT, TTC CAC GGT CAT GTC CAA ACC 
ATA GAT CCG TTC TC36 CAT ATG CTT. GGT CTC GGA TAC CAA GAG ACC ACG AGC Ti^ 
GCC GAT GCT. GAA AGT .GCC GCT GTT GTT CAT TTC AAT GGA . AGA GCT AAG CCT TGC 
CTG- GAT ATA GCA TTT CQT CAT CTA CQT CCT CTC TGG* GCT AAG TAT CTT GAT T<-3r 
TCT GAC AGA TTT ATC AAG AGC TGT ' CAC ATT AGA GCA TCA. TGA. 



Amino Acid Sequence of Sequence #15: 
Genebank Xm NP_200280 * . . 

Positions 1-535. 

MQLHISPSIiRHVTVVTOKGLREFIKVKVGSRRFSyp^EVFYSLLFFTFLLRFVF^ 
ACLGKRLKPKLLGRRVDSGNVPEAMYQVLEQPLSEiQELKGRSDIPQTLQDFMSEV 
MVT£iM£QRTRTAKXQEVI.YRHVASSSXPKQIfHCLALK]:J\NEHSIKA^^ 
ASDNiliAASWAKSLVQNarJlPHK2;VLHIITDRKTYFPMQAWFSLHPLSPAIIEVK7^ 
LEAMEKDQRVRSQFRGGSSVIVANNKENPVVVAAKLQALSPKYl^^ 
VIQTDDSPIiWDIDMNGK^GAVETCRGEDKFVMSKKFKSYLNFSNPTIAKl^ 
' RRTNISSTYYHlrn^DENUCSDLSriWQLGTIiPPGLIAFHGHVQTIDPFWH^ 
NGRAKPWIjDIAFPHLRPEHAKYLDSSpRPIKSCHIRAS. 
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